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ABSTRACT 


This  investigation  was  an  attempt  to  assess  the  convergent  and 
discriminant  validity  of  3  key  variaOi.es  (influence,  affect,  and  activity) 
in  small  group  research  by  conducting  a  systematic  study  wnich  followed  and 
extended  the  Campbell  and  Fiske  multitrait-multimethod  paradigm.  64  paid, 
volunteer,  male  undergraduates  were  divided  into  18  3-  and  4-man  groups. 

During  four  weekly  2-hour  sessions  each  group  performed  a  series  of  comparable 
tasks,  under  varying  task-load  conditions  (normal,  overload,  and  underload). 
The  3  traits  were  measured  repeatedly  by  each  of  5  methods.  Analyses 
disclosed  widespread  method  variance  among  all  rating  methodologies.  The 
implications  of  the  findings  for  small  group  research  and  for  cesearch  in 
social  and  psychological  stress  are  discussed. 
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INTRODUCTION 


Research  on  small  groups,  while  extremely  prolific  in  the  last 
three  decades,  has  produced  a  relatively  low  yield  of  cumulative 
empirical  information.  There  are  probably  a  number  of  reasons  for 
this  limited  yield  (cf.  McGrath  and  Altman,  1966).  Prominent  among 
these  reasons  are  two  distinct  but  related  methodological  problems: 

(1)  the  lack  of  effort  to  establish  construct  validity  of  many  key 
variables  and  to  eliminate  method  variance  in  the  measurement  of 
these  variables  and  (?)  the  lack  of  research  on  group  process  through 
time.  These  two  problems  are  the  foci  of  the  present  research. 

The  first  of  these  two  problems  is  a  measurement  problem. 

A  major  aspect  of  the  velidity  of  a  measurement  method  depends  upon 
how  well  its  measurements  correlate  with  other  independent  procedures 
ueed  to  mearura  the  seme  trait  (convergent  validity).  The  validity 
of  a  meaeu reman t  method  also  depends  upon  how  poorly  its  measurements 
of  one  trait  correlate  with  its  measurements  of  other,  different 
traits  (dlacriminant  validity).  Thus,  a  measurement  method  can  be 
invalidated  either  by  (a)  correlationa  with  other  methods  of  measur¬ 
ing  the  same  trait  that  are  too  low  for  convergence  or  (b)  correlation 
of  the  meaeurement  of  one  trait  with  another  trait  that  la  too  high  for 
discrimination  between  traits. 

Without  both  oonvsrgent  and  dleorlmlnant  validity  of  measurement, 
it  is  not  eleae  whether  significant  empirical  findings  are  due  to  (a) 
actual  algnlf leant  relationships  (trait  variance)  or  to  (b)  methodology 
(method  variance).  What  often  happena  is  that  different  researchers 
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each  use  a  different  single  msthod  to  measure  a  trait,  and  arrive  at 
irrsconcilabls  results.  And  in  tha  fade  of  tha  variety  of  raaults  one 
cannot  decide  where  trait  variance  and  method  variance  have  contributed 
to  the  significant  findings. 

This  problem  arises  in  many  domaina  of  psychological  research. 

The  experimental  analysis  of  stress  may  be  taken  as  a  case  in  point. 

Of  it,  Appley  and  Trumbull  (1967)  report,  "Unfortunately,  one  investigator 
relies  on  the  GSR,  a  second  on  blood  volume  changes,  a  third  on  pulse 
rats  or  heart  rate,  a  fourth  on  muscle  action  potential,  and  ao  on. 

In  animals,  the  presence  of  stress  is  often  inferred  from  feces  counts, 
trembling,  'freezing*  or  washing  behavior,  and  so  on"  (p.  6).  They 
continue  by  citing  the  nearly  incredible  variety  of  findings  which  has 
resulted. 

The  second  of  the  two  problems,  the  lack  of  research  through 
time,  is  a  matter  of  the  feasibility  of  longitudinal  research.  One 
problem  has  been  the  difficulty  of  securing  subjects  for  repeated 
measurements  over  a  period  of  time.  Another  has  been  the  absence  of 
any  set  of  standardized  tasks  or  activities  which  could  be  used  over 
a  series  of  occasions. 

The  approach  here  taken  to  the  problem  of  convergence  of 
measures  and  the  elimination  of  method  variance  follows  and  extends 
the  logic  of  the  Campbell  and  fiske  (1959)  multitrait-multimethod  matrix. 
Tha  approach  to  the  study  of  group  process  through  time  represents  a 
generalization  of  the  Campbell  and  Fiske  paradigm  to  include  the  third 
dimension  of  time-ordered  occasions.  In  effect,  the  present  study  is 
an  analysis  of  convergence  and  discrimination  among  a  set  of  behavior 
variables  (traits)  measured  by  each  of  a  set  of  methods,  for  the  members 
of  each  of  a  set  of  groups.  Each  measurement  was  replicated  over  a  set 
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of  sltuotlonelly  distinct  end  time -ordered  occasions. 

Chapter  I  begins  with  •  to  via*  of  tho  oothoda  of  meeaureaent 
which  h«vo  boon  uaod  in  aooll  group  research  and  continues  by  revise lng 
the  relevant  longitudinal  atudlaa  of  ooall  groupa.  idtere  pooalbla  those 
reviews  ooncantrata  on  studies  ehloh  have  Incorporated  stress  os  an 
Independent  variable.  Caapbell  and  risks' a  multltralt-multlasthod 
paradlga  Is  than  presented  and  extended  to  the  present  reeeerch. 

A  set  of  standardised  tasks  (He@§osen,  1966)  has  recently  bean 
•ado  available  for  social-psychological  researen  of  this  type.  Iheoe 
are  described  briefly  and  their  usefulness  for  longitudinal  studies  of 
laboratory  groups  is  noted.  Chapter  I  concludes  elth  a  presentation  of 
tho  variables  selected  for  the  present  study,  and  the  basis  for  thsir 
selection, 

experimental  procedures  occupy  Chapter  1!.  This  includes 
descriptions  of  the  exporlaental  eubjects.  procedures  followed  In 
dsta-oolleetlon,  and  data-handllng  and  soorlng 

Results  are  presented  In  Chapter  III.  for  each  aessureasnt  asthod 
used  there  are  certain  Internal  considerations  (s.g..  rotor  reliability, 
tost  homogeneity)  and  these  are  presented  first.  Convergent  and  dlsorlelnant 
validity  of  tho  naaauroment  eathoda  are  than  presented,  Following  this, 
tho  of  foots  of  ties  on  the  subjects  ora  considered.  Chapter  III  conoludoa 
with  a  presentation  of  the  affeeta  of  stress  on  tho  behaviors  of  tho 
subjeota  and  on  the  produota  written  by  tho  experimental  groupa. 

Chapter  IV  la  a  dlaouaslon  of  tho  experimental  findings 
The  dlaouaslon  amphaaliaa  findings  which  would  easily  have  boon  overlooked 
or  mlsintarprotod  in  amallor-aeale,  loss  systematic  experimentation. 

Chapter  V  is  a  discussion  of  the  findings  which  emphasizes 
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tha  uaafulnaaa  and  potantlal  of  a  ayataaatic  raaaarch  atratogy. 

Supplaaantary  foraa,  aaapla  quaationnairaa,  and  ao«a  rala^ant 
iaohnical  aatarlala,  rafarrad  to  in  tha  taxt,  aay  ba  found  in  tho 
appandlK. 


CHAPTER  I  i  BACKCROUNO  OF  THE  RESEARCH 


A  Survey  of  Methodologies 
Frequently  Used  in  the  Study  of  Small  Croups 

A  variety  of  mathodologias  Is  available  to  tha  rasas rchsr 
who  wishss  to  maasura  aapacts  of  snail  group  behavior.  Among  tha 
mora  common  ars  qusstionnairas  and  rating  forms  administsrsd  to 
group  members,  personality  tests  of  various  kinds,  and  observation 
sys tarns.  A  consideration  of  those  frequently-used  methods  reveals 
some  weaknesses  which  deserve  attention. 

Ratings  and  Questionnaires 

Rating  forms  and  questionnaires  have  long  bean  a  favorite 
among  social  scientists,  ulabb,  st_.  al_.  (1966)  note  that  such  self 
reports  have  been  the  dominant  method  of  social  science.  And,  while 
there  ars  some  standard  questionnaires  available,  it  is  most  common 
for  each  researcher  to  construct  his  own  questionnaires  and  rating 
forms  for  use  in  his  own  research.  This  has  resulted  in  an  over¬ 
dependence  upon  non-standardired  ratings  and  questionnaires. 

Some  of  the  weaknesses  of  ratings  and  questionnaires  are 
obvious.  First,  it  is  likely  that  the  subject  has  his  own  personal 
hypotheses  about  the  nature  and  intent  of  the  research  in  which  he 
is  participating  (Dulany,  1968).  And  ho  will  complete  the  ratings 
and  questionnaires  within  the  context  of  his  private  understanding 
of  the  experimental  situation.  Thus,  the  rating  or  questionnaire 
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items  may  have  a  sharply  different  meaning  for  him  than  was  intended 
by  the  experimenter. 

Second,  a  subject  enters  the  experimental  setting  with  his 
own  response-strategy.  The  possibilities  for  bias  here  run  all 
the  way  from  stylistic  "response  set"  biases  to  deliberate  strategies 
on  the  part  of  the  subjects.  Subjects  may  wish  to  cooperate  with  the 
experimenter  and  help  him  confirm  his  hypotheses,  they  may  wish  to 
oppose  the  experimenter's  interests  and  disconfirm  the  hypotheses,  or 
they  may  have  some  other  strategy  (e.g.,  appearing  "consistent"  over  a 
series  of  testing  occasions).  Whatever  the  strategy,  intentional  or 
unintentional,  it  may  introduce  a  systematic  but  unspecified  bias  into 
the  data. 

In  addition,  it  seems  likely  that  most  subjects  will  attempt 
to  present  themselves  as  favorably  as  possible.  The  concern  over 
social  desirability  (see  Scott,  1963)  points  to  this  difficulty. 

Jlhat  "favorable"  means  to  any  particular  subject  may  be  difficult 
to  specify,  but  if  such  a  strategy  operates,  than  ratings  may  contain 
systematic  error  from  this  source. 

In  his  discussion  of  demand  characteristics,  Qme  (1962)  points 
out  that  an  experimental  subject  often  tolerates  a  considerable  degree 
of  boredom,  discomfort,  or  actual  pain  during  an  experiment  and  therefore 
has  a  stake  in  the  study  in  which  he  is  participating.  It  follows  that 
he  must  feel  he  has  thereby  made  a  useful  contribution  to  scientific 
research.  In  part,  this  involves  viewing  himself  as  a  "good  subject." 

He  will  therefore  behave  in  an  experimental  context  in  a  manner  designed 
to  play  the  role  of  a  "good  subject,"  responding  to  questionnaires  in  a 
fashion  designed  to  validate  the  experimental  hypothesis  as  he  sees  it. 

None  of  these  weaknesses  is  fatal  to  the  use  of  questionnaires 
and  rating  scales.  In  spite  of  the  weaknesses,  self  reports  remain 
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as  e  most  useful  measurement  method.  But  the  weaknesses  are  sources 
of  possible  s/stematic  error  which  require  checking  against  other  methods. 

formal  Tests 

formal  tests  are  seldom  used  to  measure  group  processes,  as 
such,  but  they  are  o^ten  used  as  screening  devices  or  pre-tests  on  the 
basis  of  which  subjects  may  be  grouped  together.  Among  the  usual 
advantages  ascribed  to  tests  are  (1)  they  make  it  more  difficult  for 
a  subject  to  be  deceptive,  (2)  they  are  better  standardized,  and  (3) 
their  validity  has  been  demonstrated  by  previous  research.  The  extent 
to  which  these  advantages  hold  for  any  particular  test  may  be  a  matter 
of  question,  but  they  are  features  worth  noting. 

One  drawback  of  using  tests  in  the  study  of  group  processes 
is  that  tests  are  relatively  inflexible  with  regard  to  the  experimental 
situation.  Behind  most  formal  personality  tests  stands  a  set  of  assump¬ 
tions  about  the  traits  being  measured.  Most  often,  it  is  assumed  that 
the  traits  are  relatively  stable  features  which  abide  inside  the 
person.  It  would  be  inconsistent  with  these  assumptions  to  test 
repeatedly  the  same  subjects  on  the  same  test,  except  for  purposes  of 
checking  reliability.  Changes  in  a  subject’s  responses  are  typically 
viewed  as  a  defect  in  the  test,  rather  than  as  a  change  in  the  subject. 

An  advance  in  this  area  ma <  be  seen  in  the  work  of  Leary  (1957 ) 
and  Schutz  (1958,  1961)  on  interpersonal  theories  of  personality. 
Interpersonal  personality  theories  focus  on  interactions  between  people, 
rather  than  on  inherent  attributes  of  the  person.  Both  Leary’s  and 
Schutz’  theories  have  yielded  personality  tests  which  may  provide 
sufficient  flexibility  for  use  in  interpersonal  situations. 

Observational  Systems 

Several  observational  systems  have  been  developed  for  use  In  . 
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small  group  research  (Bales,  1950;  Lippitt,  1949;  fl^jrphy,  1937; 

Parten,  1933  ),  The  best-known  and  most  widely  used  of  these  has 
been  Bales'  Interaction  Process  Analysis  (IPA),  which  we  shall 
consider  briefly,, 

In  the  Bales  system* verbal  interactions  are  said  to  fall  into 
twelve  categories  %  Shows  Solidarity,  Shows  Tension  Release,  Agrees, 
Gives  Suggestion,  Gives  Opinion,  Gives  Orientation,  Asks  for  Orien¬ 
tation,  Asks  for  Opinion,  Asks  for  Suggestion,  Disagrees,  Shows 
Tension,  and  Shows  Antagonism,  An  observer,  often  hidden  from  the 
group  by  one-way  glass,  monitors  the  verbal  interaction  and  classifies, 
for  each  subject  separately,  the  verbal  interactions.  The  result  is 
a  time-ordered  tabulation  of  each  verbal  unit,  categorized  by  the  twelve 
categories  and  by  the  speaker.  By  analyzing  the  frequencies  for  each 
subject,  something  about  teles  within  thB  group  can  be  learned;  and 
by  grouping  the  data  into  time  intervals,  group  processes  through  time 
may  be  studied. 

The  great  weakness  of  Interaction  Process  Analysis,  and  of 
other  similar  systems,  is  that  it  requires  much  interpretation  of 
behavior  by  the  observer  as  he  is  recording  it.  Verbal  behaviors 
do  not  fall  easily  into  their  "proper"  categories.  And  identical 
statements  made  in  different  contexts  o^ten  fall  into  different  cate¬ 
gories,  This  means  that  the  data  generated  by  the  system  consist  of 
interpreted  behaviors  of  group  members.  And  It  may  be  worth  noting 
that  the  interpretation  must  be  done  under  unusually  pressing  circum¬ 
stances,  that  is,  during  the  group's  interaction  whan  rapid-fire 
exchanges  are  taking  place  among  several  group  members  and  while  each 
utterance  is  being  colored  by  tone  of  voice,  facial  expression,  and 
gestures. 
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At  on*  level  this  gives  ri«*  to  a  problem  of  rslidbllity 
between  obaervera.  for  th*  difficulty  of  handling  tha  oatagory  eyetem 
under  actual  working  conditions  can  foatar  unraliability.  At  a  non 
basic  level y  however,  it  is  a  problem  of  validity,  baeausa  all  of  tha 
data  must  contain  tha  systematic  biases  and  errors  introduced  in  tha 
process  of  interpretation,  even  when  multiple  observers  agree  on  the 
interpretation  applied.  And  once  the  data  have  been  recorded  in  the 
IPA  format,  there  is  no  way  of  separating  this  methodological  bias  from 
th  actual  verbal  and  physical  behaviors  of  the  persons  observed. 

Other  Methods 

Ulebb,  at.  al.  (1966)  present  a  wide  variety  of  classes  of 
measures  which  might  be  used  to  supplement  self-report  methodologies 
and  correct  for  systematic  biases.  While  the  methods  suggested  by 
Webb,  et.  al .  may  be  more  heuristic  than  elegant  for  most  researchers, 
there  are  promising,  unobtrusive,  non-sslf-report  methodologies  which 
have  been  made  available  in  recent  years.  Argyle  and  Kendon  (1967) 
have  reviewed  a  number  of  these  techniques,  some  of  which  have  been 
pursued  in  detail.  The  line-of-regard  measure  developed  by  R.  Sommer 
(1959,  1961,  1962,  1965)  in  his  research  on  interpersonal  space  and 
small  group  ecology  is  particularly  noteworthy.  Sommer  has  found  that 
people  adopt  different  seating  patterns  according  to  the  type  of  inter¬ 
action  in  which  they  are  engaged.  Seating  arrangement,  he  suggests, 
is  a  major  determinant  of  group  process  because  of  the  way  in  which  it 
facilitates  or  inhibits  visual  contact  between  people. 

Summary  and  Implications 

All  methods  of  measurement  contain  error  and  each  method  tends 
to  contain  certain  typee  of  errors  more  than  others,  furthermore,  the 
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•rror  tends  to  bo  systematic  (l.o.  correlated)  over  different  traits 
■ooourod  by  any  f,lven  methodology.  Tho  tragedy  lo  not,  hwovor,  not  the 
evelloblo  methodologies  con  to  In  systematic  orroro.  It  to,  rather,  tnot 
thoro  hot  boon  a  atrong  tondoncy  to  uao  a  alnolo  methodology  oncluoivoly 
in  each  roaoarch  projoct.  At  Webb,  ft.  f^.  (1966)  lament,  thoro  la  an 
overdopondence  on  a  single,  folllbla  oothodology. 

Any  measurement  taken  on  group  activities  nay  bo  portltlonod 
into  a  cooponant  which  roflocta  tho  trait  purportedly  measured  and  on 
orror  cooponant.  Thaao  have  olaoohora  boon  labolod  "relevant"  and 
"irrelevant"  componanta  (Campbell,  1960 1  Coop bo 11  and  flska,  1959 1 
Gamer,  1954}  Gamer,  Hake,  and  Crikson,  1956).  Tho  orror  noy  bo 
further  partitioned  into  a  number  of  subsections,  including  at  ioaat 
one  segment  of  orror  which  ia  peculiar  to  tha  method  of  noaauromont 
employed.  Thus,  if  aovoral  mothoda  war#  uaod  to  woaouro  a  particular 
trait,  they  would  bo  expected  to  abate  a  common  relevant  component 
and  to  differ  in  thair  patterns  of  irrelevant  componanta.  Tha  Implication 
ia  that  each  method  of  measurement  should  be  checked  against  other  mothoda 
having  different  patterns  of  irrelevant  components  (i.e.,  different 
sources  of  error). 

For  the  most  part,  we  have  itttle  information  about  the  convergence 
of  commonly-used  measures  of  group  proesassa.  But  in  at  least  on#  case, 
somte  limited  information  is  available.  Hate  and  Goldberg  (196? ) 
compared  four  strategies  of  personality  tasting  against  two  control 
strategies.  Since  tha  Hass  and  Goldbsrg  atudv  was  confined  to  tha  qanaral 
methodology  of  personality  tasting,  thsreby  retaining  tha  common  systematic 
error  duo  to  this  general  methodology,  one  wculd  have  expected  tha  validity 
coefficients  to  be  relatively  high.  Unfortunately ,  it  was  found  that  even 
within  tha  general  methodology  of  formal  personality  tasting,  non#  of  tho 
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four  strategies  (factor  analytic,  contrasted  groups,  rational,  and 
theoretical  test-construction)  correlated  particularly  well  with  the 
criterion  when  cross-validated,  although  all  strategies  did  hold  slight 
advantages  over  the  control  (stylistic  and  random  test-construction) 
strategies.  One  implication  of  this  study  is  that  all  strategies 
testing  introduce  error  variances,  leaving  the  experimenter  to  choose 
the  type  of  error  with  which  he  is  most  comfortable. 

There  are  some  more  general  implications  however,  which  follow 
from  this  brief  review  of  commonly-used  methodologies.  One  implication 
is  that  multiple  measures,  subject  to  different  sources  of  error,  must 
be  used  in  conjunction  with  each  other  more  frequently.  A  second  im¬ 
plication  is  that  there  is  a  general  need  to  collect  a  body  of  systematic 
data,  employing  these  commonly-used  measurement  methodologies,  to  enable 
investigation  of  convergence  and  differentiations  of  the  measures  with 
respect  to  each  other.  A  third  implication  is  that  observational  systems 
require  developing  in  the  direction  of  unobtrusive ,  objective  (i.e., 
lew  inference)  measurements.  The  strategy  of  the  present  study  attempted 
to  heed  these  implications. 

Tbe  Study  of  Groups  Through  Time 

It  has  long  beer  recognizee  that  the  behaviors  of  group  members 
change  a9  time  passes  and  the  group  establishes  communication  patterns, 
leadership  roles,  etc.  The  idea  that  group  Dehaviors  change  as  the 
group  works  through  various  phases  in  the  process  of  solving  problems 
goes  back  at  least  as  far  as  John  Dewey  (1910),  and  has  been  studied 
mere  recently  Dy  Bales  and  Strodtbeck  (19bi)„  In  order  to  identify  the 
nature  ana  pattern  of  these  changes,  groups  must  be  studied  through  a 
period  of  time. 
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A  number  of  reeeatchers  have  called  for  studies  in  which  a 
aariaa  of  time-ordered  observations  is  made  (Apple/  and  Trumbull,  1967; 
Firth,  1963;  Leach,  1954;  McGrath  and  Altman,  i966;  Nadel,  1964; 

Smith,  1966).  But  in  apite  of  tha  widely-recognized  need,  relatively 
few  etudios  through  time  have  been  conducted,  especially  under  well- 
controlled  experimental  conditions  (Tuckman,  1965).  This  has  resulted 
in  inadequate  understanding  of  group  processes  through  time. 

Time-related  reaearch  on  groups  may  be  divided  into  two  cate¬ 
gories,  depending  upon  the  kinds  of  changing  behavior  patterns  being 
investigated^  (a)  patterns  of  activity  in  groups  which  change  from 
meeting  to  meeting,  and  (b)  shifts  or  phase  movements  in  activity 
within  a  single  meeting.  Research  in  the  first  category  has  consisted 
primarily  of  field  studies,  of  which  there  have  been  only  a  few. 

Research  in  the  second  category  has  received  some  attention  in  the 
laboratory.  Examples  of  each  type  will  be  cited  briefly. 

Shifts  Between  Sessions 

Long-standing  associations  and  long-duration  missions  have 
provided  some  opportunities  for  the  study  of  a  series  of  meetings  of 
a  single  group.  Newcomb  (1961),  in  his  well-known  study  of  college 
men  who  lived  together  during  the  school  year,  dealt  with  changes  in 
associations  over  time.  His  primary  interest  was  in  ths  formation  and 
change  of  friendship  relationships  among  his  subjects.  He  found  that 
friendship  patterns  among  his  subjects  tended  to  stabilize  during  the 
first  two  weeks  and  to  remain  relatively  unchanged  thereafter. 

Smith  (1966),  reports  a  study  which  he  conducted  while  travelling 
with  a  seven-men  Antarctic  exploration  group  through  its  entire  (four 
month)  existence  as  it  corducted  geological  studies  on  an  ice  field. 
Smith's  chief  psychological  interests  were  in  group  structure  and  the 
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affects  of  personal  danger,  boredom,  and  monotony*  Regarding  group 
structure,  he  found  that  it  took  about  a  week  for  a  work-structure  to 
develop,  In  about  three  and  one-half  weeks  an  interpersonal  structure 
developed*  No  structural  changes  were  discovered  in  succeeding  weeks. 

Such  field  studies  are  especially  valuable  for  identifying  the 
important  variables  which  may  relate  to  group  processes*  Clearly,  it 
seems  on  the  basis  of  these  studies  that  some  aspects  of  group  behavior 
do  change  over  time*  But  further  studies  with  stronger  experimental 
control  are  necessary  to  isolate  or  refine  such  variables. 

One  developing  body  of  research  conducted  by  Haythom,  at.  el. 
(Haythorn  and  Altman,  1967;  Altman  ana  Haythorn,  1965;  Haythom,  Altman 
and  Myers,  1966)  has  brought  relatively  long-duration  (continuous  10-day 
observation)  studies  into  the  laboratory.  The  focus  of  Haythom 's 
research  is  the  isolation  of  small  groups  and  his  primary  concern  is 
with  the  effects  of  isoletion  on  behaviors  of  group  nembers  (e*g.,  self- 
disclosure)*  He  has  also  investigated  some  effects  of  group  composition 
(combinations  cf  subjects  ha.ing  certain  personality  traits  like 
dominance,  dogmatism,  ate*)  on  isolated  groups.  Unfortunately,  this  body 
of  research  ;.s  limited  tc  studies  cf  two-man  groups*  Although  these 
studies  are  not  uerct'ucted  sc  as  to  assess  the  effects  of  time  so 
much  as  tha  effects  of  Jsc;ation,  thev  do  reccgr.izs  the  nred  fry 
psychological  studies  through  time,  and  in  pert  fill  that  need* 

Somewhat  related  to  the  Haythorn  studies  is  a  body  of  re8earch 
on  stress  in  long-duration  military  groups*  Much  of  thia  research 
has  been  conducted  by  ifeybrew  (1957,  1963).  Related  research  has  been 
conducted  by  Faucett  and  Newman  (1953),  levine  (1958),  and  Ritch  (1940)* 
An  excellent  review  of  these  studies  may  be  found  in  uleybrew  (1967). 

This  boev  cf  research  l*  primarily  concerned  with  tne  stress  of  long 
term  isolated  m-iss.ons  aboard  submarines,  and  the  onpendent  variables 
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measured  includa  measures  such  as  antialartnasa,  critical  fllckar 
frequency,  and  parcaptual  spans.  So  It  la  difficult  to  aasoas,  fi'om 
thasa  studias,  tha  af facts  of  tha  long- tarn  missions  on  interaction, 
doainanca  in  tha  group,  and  othar  intarparaonal  variables  of  intaroat 
in  ragard  to  group  process. 

R.  F„  Bales  is  one  researcher  who  has  attempted  to  study  group 
intaraction  and  its  changes  between  successive  sessions  undar  controlled 
experimental  conditions,  Hainicka  and  Balsa  (1953)  report  an  experiment 
which  observed  initially  leaderleas  groups  over  a  series  of  four  meetings. 
Observation  was  by  tha  Bales  category  system.  It  was  found  that  members 
spent  lass  tim&  in  task  behavior  and  more  in  positive  socio-emotional 
behavior  aa  tha  sarias  of  meetings  progressed.  Negative  socio-emotional 
behavior  rose  briefly  during  the  second  meeting,  in  what  was  labeled  the 
"status  struggle.'1  A  similar  pattern  was  noted  in  a  longer  series  of 
meetings  in  a  study  by  Theodorson  (1953).  However,  groups  which  met 
for  only  one  session  did  not  encounter  the  "status  struggle"  which 
typically  took  place  during  the  second  session  of  longer-duration  groups. 

In  a  study  of  ten  5-man  groups  which  met  for  five  consecutive 
weekly  sessions,  Deutsch  (1949)  found  several  significant  differences 
between  cooperative  and  competitive  groups.  Cooperative  groups  worked 
faster,  were  better  coordinated,  and  had  fewer  communication  diff icultiss. 
However,  no  striking  developmental  differences  over  time  were  found. 

In  summary,  the  research  on  group  processes  through  time  Is 
relatively  eparse  and  scattered.  Some  studies  of  long-duration  missions 
tend  to  lack  experimental  control  (as  in  the  Antarctic  studies). 

Others  have  not  been  primarily  concerned  with  the  affects  of  time  or 
with  the  psychological  dependent  variables  which  would  aid  in  under¬ 
standing  group  process  (studies  of  isolation  and  submarine  studies). 

The  work  of  Bales,  et^„  al„  has  shown  that  significant  changes  in  group 
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behavior  may  occur  between  sessions,  although  Deutsch  failed  to  find 
between-session  changes „ 

Phases  Within  Sessions 

There  has  been  some  research  on  changes  in  group  behaviors  within 
a  single  meeting  or  session,  although  this  subject  area  has  barely  been 
tappedc  Bales  has  probably  made  as  large  a  contribution  hers  as  any 
other  researcher.  In  a  large  number  of  studios,  Bales  has  identified 
three  stages  or  phases  (Orientation,  Evaluation,  and  Control)  through 
which  problem-solving  groups  move  within  a  single  session  (Bales, 

1950a,  1950b;  Bales  and  Gerbrands,  194B;  Bales  and  Strodtbeck,  1964). 
Other  research  on  within-session  phases  of  activity  have  been  relatively 
sparse.  Tuckman  (1965)  has  reviewed  and  integrated  some  50  studies 
dealing  with  stages  of  group  development  over  time,  and  has  proposed  a 
conceptual  model  for  understanding  observed  stages.  There  are  studies  of 
therapy  groups  and  their  phases,  but  these  seem  to  be  unique  to  therapy 
groups  and  not  easily  generalizable  to  groups  of  other  types  (Talland, 
1955). 

Thus,  some  significant  changes  in  group  behavior  which  need 
further  investigation  seem  to  take  place  through  time.  Studies  which 
consider  only  a  single  session  of  a  group  (whether  that  session  be  the 
fi^st,  last,  or  an  intermediate  one  in  a  series),  may  be  Oi/erlca*lr*g 
Important  changes  in  group  behavior.  Such  studies  also  run  the  risk  of 
projecting  their  empirical  findings  to  groups  '‘in  general ,”  which  may 
be  an  unwarranted  generalization.  The  implication  which  follows  from 
this  survey  of  research  is  that  there  is  a  strong  need  to  collect  a 
systematic  body  of  data  on  group  behaviors  through  time. 
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Campbell  and  fiaka'a  Nultltreit- 
Multiaethod  Paradigm 

Caapball  and  risks  (1959)  hava  data  Had  an  approach  to  oxparl- 
aontation  which  allows  £  to  aaparata  trait  variance  froa  aathod  variance 
and  tharaby  to  oetabliah  both  convergent  and  dleerlalnant  validity  for 
his  aathoda.  Briefly  described,  the  Caapball  and  flaka  approach  con- 
slats  of  aaaaurlng  each  of  savoral  traits  by  each  of  several  aathoda. 

If  the  traita  aaaaurad  are  functional  unities  and  if  the  aaasuraaant 
aathoda  are  valid,  the  various  aathoda  should  correlate  wall  (con verge) 
when  aaaaurlng  the  aaaa  trait.  The  aathoda  should  correlate  poorly 
(discriainata),  both  within  a  aathod  and  between  aathoda,  whan  aaaaurlng 
different  traits. 

Tha  approach  is  illustrated  in  Table  1.  The  illustration 
involves  three  different  traits,  each  aaaaurad  by  threw  different 
aathoda.  Tha  x-values  in  tha  matrix  represent  reliabilities.  Those 
could  also  ba  spoksn  of  aa  monotrait-monomethod  values.  Caapball  and 
flaks  intend  teat-retest  reliabilities  to  bs  entered  hare,  although 
there  are  other  kinds  of  reliability  which  could  bo  used.  The  y-valuos 
represent  convergent  validity,  that  is,  the  extent  of  agreement  between 
two  methods  whan  used  to  measure  the  same  trait,  The*#  may  el«  j  be 
spoken  of  as  monotrait-hate roaethod  valuer*,  ^igh  correlations- 
indicate  that  two  methods  attempting  to  measure  the  same  trait  are 
in  agreement,  thus  lending  credence  to  the  belief  that  both  are 
actually  measuring  the  same  trait. 

The  z-values  are  heterotrait-monomethod  correlations,  which 

represent  method  variance.  When  different  traita  are  being  aaaaurad, 
high  hate  rot  rait-monomsthod  values  are  a  sign  that  a  particular  mathod 
agrees  with  itself  regardless  of  the  traits  it  is  being  used  to 
measure.  Low  values,  on  the  other  hand,  indicate  that  different  traits 


TABLE  1 

a  synthetic  aultitraikiwulti method  matrix 

Method  I  i>bthod  2  fltihod  3 

^  6|  ^2  ^2  A^  ” E*j 


Mtthod  I 


Method  2 


Mexico  3 
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are  measured  differently  dithin  the  method.  Thus,  low  values  are  desired 
here,  which,  ir.  combination  with  high  monotrait-heteromethod  values, 
would  indicate  discriminant  validity. 

The  w-values  are  heterotrait-heteromethod  values.  These  are 

% 

correlations  between  different  traits  as  they  are  measured  by  different 
methods.  Provided  that  reasonably  valid  measurement  methods  are  being 
used,  these  values  will  indicate  the  true  relationships  among  the 
traits. 

The  example  in  Table  1  could  be  extended  to  include  as  many 
traits  or  methods  as  one  might  desire.  (The  present  research  measured 
each  of  three  traits  by  each  of  five  methods.)  It  is  important  to 
consider  the  values  in  the  multitrait-multimethod  matrix  in  the  order 
presented  hare.  Since  the  validity  of  any  measurement  method  is  limited 
by  its  reliability,  the  reliabilities  of  the  methods  (x-values  in  Table  1) 
must  be  considered  first.  The  other  (y,  z,  and  w)  values  must  be 
considered  in  relationship  to  each  othar  and,  strictly  speaking,  need 
not  be  ordered.  But  it  is  convenient  to  look  for  the  possibility  of 
convergent  validity  early.  If  convergence  among  methods  does  not  appear 
to  be  present  (on  the  basis  of  tha  y-valuas),  further  consideration  of 
the  matrix  is  of  little  value.  Given  convargance,  however,  one  may 
inspect  the  z-valuas  for  discrimination  between  traits. 

As  a  final  test  of  validity,  tha  pattern  of  the  w-values 
should  be  noted.  Given  reliability  and  validity  of  measurements,  the 
w-values  should  represent  the  true  relationships  between  pairs  of 
traits.  Since  the  underlying  traits  bear  the  same  relationship  to  each 
other  regardless  of  the  measurement  methods  employed,  the  pattern  of 
w-values  within  a  heterotrait-hsteromethod  triangle  should  be  the 
same,  regardless  of  the  heterotrait-heteromethod  triangle  being  con¬ 
sidered. 
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Second;  there  must  be  a  set  of  methods  which  can  bo  implemented 
simultaneously,  or  nearly  so,  within  the  situation  in  which  the  traite 
appear.  The  methods  must  be  subject  to  different  sources  of  error 
variance  (sec  Webb,  et.  al. ,  1966),  Ideally,  of  course,  the  aothods 
would  be  completely  independent  of  each  other  and  therefore  orthogonal 
with  respect  to  error  variance  (see  Gamer,  Hake  and  Eriksen,  1956), 

Third,  there  must  be  a  set  of  time-ordered  occasions  on  which 
the  measurement  methods  can  be  applied  to  the  traits.  These  time- 
ordered  occasions  must  be  comparable  to  each  other,  eo  that  the 
measurements  may  be  meaningfully  related  with  one  another 

Hackman” s  Standardized  Laboratory  Tasks 

The  requirement  of  some  basis  for  comparability  among  occasions 
in  a  tiros  ordered  series  has  been  a  particularly  difficult  one  for 
social-psychological  research  because  of  the  lack  of  a  set  of  standardized 
tasks  or  activities.  Unless  tasks  are  appropriately  held  constant, 
counterbalanced,  or  sampled  throughout  an  experimental  design,  a  real 
possibility  exists  that  the  results  of  the  study  may  be  seriously 
confounded  with  unintended  task  effects.  Task  type  has  proveo  to  be 
a  large  factor  in  4.  cumber  of  studies.  Hackman  jffdfh  ‘  sr?  ft?*, 
task  type  controlled  well  ever  50 %  of  the  .'*oi .*/'■•?  Among  group  r  cducts 
in  some  dimensions. 

Holding  tasks  constant  over  a  series  of  occasions  may  place 
severe  (though  sometimes  hazy)  restrictions  on  the  generalizability 
of  results.  And  the  repetition  of  the  same  task  over  occasions  allows 
Ss  to  practice  the  tasks,  thereby  confounding  practice  effects  with 
the  effects  of  time.  On  the  other  hand,  counterbalancing  and  sampling 
of  task  types  encounter  other  problems  ■  (l)  Experimental  manipulations 

are  easily  confounded  with  tes*  variance  and  (?)  the  relevant  parameters 
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of  available  tasks ,  necessary  for  appropriate  counterbalancing  and  sampling , 
■ay  not  be  known. 

Fortunately,  some  recent  research  by  Hackman  (1966)  has  identified 

certain  Major  parameters  of  written  intellective  tasks  (task  type  and 
difficulty)  and  haa  provided  a  set  of  106  tasks,  balanced  in  terms 

of  these  parameters,  which  are  appropriate  for  small-group  experimental 
sessions.  Hackman  has  also  provided  a  set  of  scales  along  which  products 
generated  from  these  tasks  may  be  evaluated. 

The  three  task  types  isolated  by  Hackman  ares  production, 
discussion,  and  problem-solving,  Production  tasks  require  Ss  to  generate 
ideas  about  some  topic.  Discussion  tasks  require  the  selection  of  a 
"solution"  from  among  a  set  of  possibilities.  Problem-solving  tasks 
require  Ss  to  decide  how  a  particular  solution  can  be  executed.  Each 
task  requires  about  15  minutes  working  tims  and  sach  requires  a  written 
product.  The  tasks  are  graded  into  levels  of  difficulty  within  type. 
Normative  data  are  available  for  small  groups  from  size  two  to  size 
seven,  although  the  bulk  of  the  data  have  been  gathered  from  3-roan 
groups. 

The  availability  of  this  set  of  standardized  tasks  has  made  it 
possible  to  use  the  mult it ra it -multimethod  approach  of  Campbell  and 
Fiake,  testing  each  group  over  a  series  of  occasions  on  different 
(but  comparable)  tasks.  Thus,  it  has  also  become  possible  to  manipulate 
situational  factors  during  experimental  sessions,  sirce  these  mani¬ 
pulations  may  be  kept  orthogonal  to  the  basic  task  parameters. 

General  Plan  of  tne  Study 

The  implications  of  the  roregoing  sections  are  for  collection 
of  a  systematic  body  of  data,  having  certain  features,  as  follows; 

(1)  The  data  must  involve  the  measurement  of  a  set  of  traits  or  behaviors 
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by  each  of  a  sat  of  methods,  (2)  Tha  traits  must  ba  sufficiantly  distinct 
from  each  other  to  allow  the  methods  to  discriminate  between  them, 

(3)  The  methods  must  be  sufficiently  independent  to  make  them  subject 
to  different  patterns  of  error  variance,  (4)  The  body  of  data  muet  respect 
distinctions  between  types  of  group  tasks,  (5)  Observations  must  be 
taken  through  time  so  that  the  effects  of  tim®  may  be  notedc  The  following 
arrangement  of  experimental  conditions  was  designed  to  produce  a  body  of 
data  conforming  to  these  specifications. 

Traits  Measured 

In  a  methodological  study  such  as  this,  any  of  several  sets  of 
traits  could  have  been  selected.  Those  chosen  for  this  particular 
study  have  been  selected  because  they  seamed  fundamental  to  interpersonal 
process.  Examination  of  a  range  of  empirical  and  theoretical  concepts 
suggests  that  three  processes,  influence,  affect,  and  activity,  are 
inherent  in  any  interpersonal  situation.  The  effects  of  many  important 
member,  group,  and  situational  variables  seem  to  be  mediated  through 
one  or  more  of  these  three  fundamental  interpersonal  processes 
(McGrath,  1963). 

One  source  which  identifies  these  three  basic  parameters  is 
the  research  of  ltn.jt.7  (1958,  1961),  who  asserts  tha:  tners  three 
basic  interpersonal  needs:  inclusion,  .cot  Ao  ,  and  affection,  he 
inclusion  dimension  corresponds  in  part  to  activity.  Control  cor¬ 
responds  well  tc  influence.  And  the  affection  dimension  is  self-evident. 

Another  source  which  identifies  these  three  basic  parameters, 
in  slightly  different,  but  parallel,  terminology,  is  Osgood  et_.  al^. 

(1957).  He  has  repeatedly  found  activity,  potency,  an«  evaluation  to 
be  dimensions  of  semantic  space,  ytfhlle  the  contents  of  Osgood's  and 
Schut7r  work  ar6  i-.oite  different,  there  are  marked  similarities  between 
matched  pairs  c*f  concepts.  In  an  interpersonal  situation  Schutz' 
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"inclusion"  and  Osgood’s  "activity"  ars  both  clossly  rslstsd  to 
participation,  or  amount  of  infraction.  Similarly,  "control"  err 
"potency"  are  both  Intimately  tied  to  inf  1 nance.  And,  in  an  inter* 
personal  situation,  Osgood's  "evaluation"  and  Schut*'  "affection" 
both  have  to  do  with  the  same  parameter  of  valence  or  attraction 
between  persons. 

Hemphill  (Hemphill  and  Westie,  1950)  devised  two  dimensions  of 
leader-member  relations  (initiation  of  structures,  and  consideration) 
which  are  similar  to  influence  and  affect.  Other  researchers  who  have 
found  strikingly  similar  dimensions  of  interpersonal  relations  ere 
Borgatta,  Cottrell,  and  Mann  (1958),  Bales  and  Slater  (1955),  Carter 
(1951),  Leary  (1957),  and  Schaefer  (1960). 

These  three  dimensions  also  reflect  the  four  primary  dimensions 
of  group  structure  as  formulated  by  Cartwright  and  Zander  (i960). 

Their  concepts  of  the  communication  structure  of  a  grcjp,  and  its  task 
structure  or  division  of  labor,  both  refer  to  patterns  of  interaction, 
with  communication  and  task  activity  being  specific  forms  of  interaction. 
Their  term  "power  structure"  refers  to  the  pattern  of  interpersonal 
Influence.  Their  "'aoclometric"  or  "friendship  structure"  refers  to 
the  patterns  of  Interpersonal  affect  or  attraction. 

One  may  reasonably  conclude  that  In*  three  Kccai*.e ters, 
affect,  and  activity,  are  basic  dimensions  of  any  Interpersonal  situation. 

The  present  research  attempted  to  measure  these  three  parameters  as 
the  "behavior  traits"  of  the  study. 

Methods  of  Measurement 

The  important  feature  of  the  methods  of  measurement  used  is  that 
they  differ  in  the  sources  of  error  variance  to  which  they  are  subject. 

If  error  variance  is  confounded  with  trait  variance,  convergence 
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beteeen  aethode  could  be  due  either  to  functional  unity  of  tho  trait 
being  aeesured  or  to  error.  But  If  tho  error  vorieneo  in  too  (or  eore) 
•at nod*  01  f for*,  convergence  between  eethode  can  ooro  roaaonably  bo 
attributed  to  functional  unity  of  the  trait  being  aeeeurod.  Thua,  a 
act  of  oathoda  *nien  differ  in  their  ooureee  of  error  ouat  bo  ohooon. 

five  different  oathoda  of  neeeuroaent  ooro  aalaetad  for  thla 
study  aalf  reporta,  pear  rating*,  obeerver  rating*,  personality 
tasting,  and  event  record*.  The  first  throe  sight  bo  grouped  together 
as  a  single,  'rating*  oothodology.  The  lopleoentation  of  those  eetho- 
dologla*  1*  data! lad  in  Chapter  II. 

Occasion* 

rh*  availability  of  Hpckaan1*  tasks  allowed  the  present  research 
to  us*  the  oultltralt>oultiaathod  approach  of  Caepbell  and  risk*, 
tasting  **cn  group  over  a  series  of  occasions  on  tasks  ehlch  were 
different  but  cooparabl*  In  type  and  difficulty.  Those  standardized 
battarias  of  tasks  also  alloasd  tha  oanipulatlon  cf  a  situational 
r actor  (stress)  in  *  way  tnat  avoided  confounding  task  variance  with 
trssteert  variance. 

Th*  e>ps'.a«r»*I  groupa  aat  for  four  sessions  of  two  hours  each 
dcr.ng  which  tn*y  worked  on  Hacka*n’s  fisks.  fhs  first  hour  of  each 
of  th*  f'jt  sessions  followed  tha  ssaa  procedure  for  all  four  sessions. 
Inis  first  hci~  ?f  each  session  consisted  of  a  noreal,  coefortabie  work 
lead  fer  me  group.  The  sacond  hour  of  aach  session  variad  from  aaasion 
to  aesslcn.  for  the  second  hejr  of  tho  first  and  third  assaiona  tha 
groupa  ware  subjected  to  “stress”  in  the  fore  of  a  work  overload. 

The  second  hour  of  the  second  end  fourth  sessions  involved  a  "stressful " 
underload .  Procedures  f or  ieposing  the  stress  upon  the  group?  ars 
described  tr  Chapter  II.  *>>•  second  and  fourth  sessions  war  a  shortened 


slightly  to  ailoui  for  administration  of  the  Interpersonal  Adjective 
Checklist  (session  two)  and  for  de-briefing  of  the  Ss  at  the  end  'f  the 
study  (session  four). 

The  experimental  design  is  pictured  in  Figures  1  and  2. 

Figure  1  pictures  the  multitrait-multimethod  matrix  on  the  front  "face" 
with  occasions  pictured  in  the  third  dimension.  Figurs  2  is  a  detailed 
picture  of  the  experimental  situations.  Each  of  these  figures  exists 
conceptually  in  triplicate,  one  of  each  for  each  task  type. 


b^TBJL 


Figure  1.  Over-all  Design 
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Figure  2.  Experimental  Sessions 


CHAPTER  II:  EXPERIMENTAL  PROCEDURES 


Subjects  and  Data-Collection 

The  Ss  were  64  male  students  in  Psychology  100  at  the 
University  of  Illinois.  Ss  were  recruited  for  this  study  as  they 
finished  participation  in  Lowe's  experiment.*  Participation  in 
Lowe's  experiment  was  a  course  requirement,  while  participation  in 
the  present  study  was  voluntary.  J5s  were  offered  $1,50  per  hour 
for  eight  hours  of  work  to  participate  in  the  present  study. 

Ss  were  divided  among  eighteen  3-  and  4-man  groups.  five 
groups  (three  4-man  and  two  3-man)  worked  on  production  tasks.  Six 
groups  (two  4-man  and  four  3-man)  worked  on  discussion  tasks.  And 
seven  groups  (five  4-man  and  two  3-man)  worked  on  problem-solving 
tasks.  The  assignment  of  task  type  was  based  on  the  Ss'  experiences 
in  Lowe's  study.  Ss  who  had  worked  on  production  tasks  in  her  study 
were  assigned  to  production  tasks  in  the  present  study,  etc. 

One  of  the  4-man  discussion  groups  and  one  of  the  4-man 
problem-solving  groups  lost  a  member  after  the  initial  session. 

One  of  the  4-man  discussion  groups  and  one  of  the  4-man  problem¬ 
solving  groups  lost  a  member  after  session  III.  All  other  groups 
maintained  their  initial  membership  throughout  the  experiment.  One 
of  the  3-man  discussion  groups  met  for  three  sessions  of  two  hours 

*See  appended  note  for  the  relationship  of  the  present  study 
to  that  of  Rosemary  Lowe. 
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aach,  All  othar  groupa  mat  for  four  aaaalona  of  two  hour a  aach.  Tha 
mamba rah ip  of  tha  groupa  participating  in  tha  study  in  praaantad 
fully  in  Tabla  2.* 

In  general,  tha  experimental  aaaalona  were  hald  weekly  and  tha 
four  aaaalona  were  complatad  in  four  consecutive  weeka.  In  aoma 
cases,  however,  adjuatmanta  in  tha  achadula  were  mada  to  accommodata 
Sa  who  had  tasta  or  wara  ill,  and  to  allow  for  cartain  holidaya  and 
vacationa. 

During  tha  axparimantal  saaaiona  Sa  aat  at  a  ractangular  tabla 
locatad  at  one  and  of  a  long,  narrow  room.  Two  mambara  aat  along  ona 
side  and  one  at  aach  and  (4-man  groupa).  Sa  in  3-man  groups  ware  fraa 
to  chooaa  any  thraa  of  tha  four  chairs,  with  boat  choosing  tha  two  and 
chairs  and  one  along  the  back  aids. 

An  8  mm  movie  camera  waa  locatad  at  the  othar  and  of  tha 
room,  and  the  experimenter  aat  at  a  tabla  beside  the  camera.  Tha 
camera  waa  aat  on  single  frams  mods  and  positioned  ao  that  the  entire 
group  appeared  in  tha  photographic  field.  Tha  cameraman  waa  signalled 
to  photograph  by  a  pair  of  Hunter  decade  timers,  wired  and  adjusted 
to  signal  a  five-second  observation  period  at  the  beginning  of  eacn 
minute.  Three  photographs  wara  taken  during  this  observation  period, 
spread  evenly  over  tha  five  seconds.  While  photographing,  the  cameraman 
notad  which  member (s)  of  the  group  was  (wore)  talking.  The  instructions 
and  record-keeping  form  for  the  cameraman  may  be  found  in  the  appendix. 

At  the  beginning  of  each  session  each  group  was  instructed  to 
work  at  a  normal,  comfortable  pace.  -The -inetruetions  wares 

During  this  first  hour  today  we  would  like  you  to  work 

at  a  normal,  comfortable  pace,  the  tasks  are  designed 


*5ee  appended  note  for  a  discussion  of  3-  vs.  4-man  groups  in 
this  study. 
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to  take  about  15  minutes  each„  Sov  you  might  expect  to 
complete  about  three  tasks  during  the  first  hour.  But 
v  do  vary  somewhat  and  different  groups  work  at 
different  speeds. 

some  groups  work  a  little  faster  than  others  and  some 
are  more  deliberate.  5o  you  might  complete  only  one 
or  two  tasks,  or  you  might  finish  four.  In  any  case, 
work  at  a  pace  that  is  comfortable  for  this  group. 

Signal  me  when  you  have  finished  a  task  and  I  will 
have  some  questionnaires  for  you  to  fill  out. 

Here  is  your  first  task,  you  may  begin. 

At  the  completion  of  each  task  during  this  hour  the  experi¬ 
menter  administered  the  self-rating/peer-rating  form  to  members  of 
the  group  (appended  form  S»2-g).  Data  from  this  form  were  used  to 
generate  self-rating  and  peer-rating  scores  of  influence,  affect,  and 
activity  for  6ach  5^  While  Ss  were  completing  the  rating  forms,  the 
experimenter  was  rating  all  members  of  the  group  on  the  observer  rating 
form  (appended  form  2-g).  Data  from  this  form  were  used  to  generate 
observer  ratings  of  influence,  affect,  and  activity  for  each  S.. 

Questions  concerning  motivation  and  coordination  in  the  group  were  also 
included  on  the  rating  forms. 

As  soon  as  the  rating  forms  were  complete  and  collected,  the 
group  was  instructed  to  begin  the  next  task.  A  five-minute  "break" 
was  taken  at  the  end  of  the  first  hour.  The  first  hour  of  Session  II 
was  shortened  by  about  five  minutes  in  order  to  allow  time  for  Ss  to 
take  the  Interpersonal  Adjective  Checklist  (Laforge,  1963;  Leary,  1957). 

The  second  hour  of  Sessions  I  and  III  was  designed  to  produce 
an  overload  situation  for  the  groups.  They  were  told  that  information 
was  needed  about  the  maximum  work  load  that  groups  could  handle  and 
therefore  they  should  work  as  rapidly  as  possible.  The  instructions 
were : 


lie  need  some  information  about  the  maximum  work  load  which 
groups  like  this  can  handle.  We  would  therefore  like  you 
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to  work  as  rapidly  as  possible  without  sacrificing  the  quality 
of  your  products.  Keep  the  quality  comparable  to  what  you 
produced  last  hour. 

As  an  added  incentive,  we  are  offering  a  $5.00  per  man 
bonus  for  the  group  which  has  the  best  over-all  perfor¬ 
mance  in  this  phase  of  th*  experiment.  Your  score  from  this 
hour  will  be  combined  with  your  scores  from  certain  other 
hours  in  order  to  determine  which  group  wins,  for  this 
particular  hour  it  is  speed  that  counts— the  number  of 
adequate  products  written.  Here  are  the  tasks  for  this 
hour.  Do  them  in  this  order.  When  you  finish  one  task, 
go  right  on  to  the  next  one.  I  will  interrupt  you  at 
15-minute  intervals  for  rating  questionnaires.  You  may 
begin. 

The  second  hour  of  Sessions  II  and  IV  was  designed  to  produce 

an  underload  condition.  The  instructions  ^or  these  hours  were: 

We  need  some  information  about  the  maximum  quality  which 
can  be  produced  by  groups  of  this  type.  So  for  this  hour 
we  would  like  you  to  write  as  complate  and  high  quality 
a  product  as  you  can.  Speed  is  of  no  importance  here. 

You  may  well  spend  the  full  hour  on  one  task,  although 
there  is  a  second  task  available  if  the  first  one  does  not 
completely  fill  the  hour.  I  will  interrupt  you  at  15-minute 
intervals  for  ratings.  The  $5.00  per  man  bonus  applies  to 
this  hour.  Here  are  the  tasks.  You  may  begin. 

The  subjects'  and  observers'  rating  forms  used  at  15-minute 

intervals  during  the  second  hour  were  identical  to  those  used  for 

the  first  hour. 

The  tasks  used  were  those  provided  by  Hackman  (1966).  ’n 
all,  there  were  108  different  tasks,  36  in  eacr  of  the  three  fas* 
types.  A  given  group  worked  on  tasks  of  only  one  type.  The  tasks 
in  each  type  were  randomly  sequenced  prior  to  the  experiment.  All 
groups  within  a  task  type  experienced  an  identical  sequence  of  these 
tasks. 


Four  tasks  were  provided  Tor  each  of  the  normal  load  hours. 

Eight  tasks  were  provided  for  each  of  the  overload  hours.  And  two  tasks 
were  avail aoie  for  each  of  the  underload  hours. 

The  oata  resulting  from  these  procedures  were?  (1)  up  to 
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2B  post-task  questionnaires  from  each  ji  and  observer,  (2)  an  Inter¬ 
personal  Adjective  Checklist  from  each  (3)  a  series  of  photographs 
(of  each  group)  which  systematically  time-sampled  the  task  activities, 
and  (4)  up  to  36  written  products  from  each  group. 

Data  Handling  and  Scoring 


Post-Task  Ratirgs 

Each  item  on  the  questionnaires  was  scored  on  a  seven-point 
scale,  with  seven  representing  the  "high"  end  of  the  scale  and  one 
representing  the  "low"  end  of  the  scale.  Influence  was  scored  as  the 
average  of  questions  two  and  five.  Affect  was  scored  as  the  average  of 
questions  six  and  seven.  And  activity  was  scored  as  the  average  of 
questions  two  and  three.  Thus,  each  of  the  variables  could  range  from 
one  to  seven  for  self,  peer,  and  observer  ratings.  Questions  one, 
four,  eight,  and  nine  were  included  so  that  there  would  be  systematic 
relationships  between  Lowe's  study  and  the  present  study.  These 
questions  were  not  scored  for  the  present  study. 

The  Interpersonal  Adjective  Checklists  (Id's) 

The  I  CL  ’  s  were  scored  in  the  standard  way  ( Leary,  1957  /  Ar- 
order  to  generate  measures  of  influence  t Leary’s  Dom-Sub  scale)  ar o 
Affett  (Learv’s  (  ove-Hate  scale).  In  addition  to  this  standard  scoring, 
the  total  number  of  items  checked  by  _S  on  the  I  CL  was  noted  as  an 
indicator  of  the  S's  activity  level. 

Event  Records 

The  development  of  a  method  for  noting  and  scoring  behaviors 
of  group  members  ("event  records'*)  may  be  one  of  the  most  useful  results 
of  the  present  research. 
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"Event  records"  were  taken  o /  means  of  photographing  the  group, 
on  a  time-sample  basis,  throughout  the  experimental  sessions.  These 
photographic  records  are  supplemented  only  by  the  notation  of  the  cameraman 
as  to  who  was  talking  at  thB  time  he  took  the  pictures.  Scores  of 
influence,  affect,  and  activity  were  taker  from  the  photographs  as 
f  cl lows ; 

(a)  ln«'iuerce  Given  A  person  influences  others  if  he  talks, 
if  he  is  looked  at  by  another  person,  and  additionally  if  he  is 
taLking  and  is  locking  reciprocally  at  another  person.  Thus,  a  S 
received  one  "Influence  Given"  point  for  each  time  he  talked, 
multiplied  bv  the  number  cf  people  to  whom  he  talked  (n-1).  He  alsc 
received  an  "Influence  Given"  point  for  being  looked  at  by  another 

S,  and  an  additional  point  if,  while  he  was  talking,  he  ana  another 
S  locked  at  each  other  reciprocally.  These  scores  cculd  vary  from  zero 
to  nine. 

(b)  In  •l.erce  Receive a  A  person  receives  influence  if  he  is 
talked  to,  if  he  looks  at  another  person,  ana  additionally  if  "e  looks 
reciprocally  at  another  person  who  is  talking.  Thus,  a  person  received 
an  "Inriuence  Rece...- ed"  point  fot  each  time  he  was  talker)  to,  one  poir- 
f or  each  person  ne  looked  at,  and  one  addit.  .  vnal  pcint  if  he  arc  a 
speaker  were  looking  at  each  other.  Influence  received  scores  could 
vary  from  zero  to  nine, 

(c;  Pcs i  t  e  A*  ♦  ec  t  G,  i-er  A  person  exhibits  posi  tive  affect 
when  he  smiles  or  laughs.  Even  if  no  one  sees  nis  smile,  He  is 
generating  positive  affect  and  "giving"  it,  as  it  were,  to  hjmseLf, 

If  someone  sees  the  smile  (i,e,,  is  looking  at  the  smile),  the  positive 
affect-  is  given  to  the  person  who  sees  it.  Aadi tionaliy ,  if  the  smiling 
person  looks  reciprocally  at  another  person  the  posit! -e  affect,  is  more 
strongly  given.  Thus,  a  subject  re  reived  a  ’Positive  Affect  Given" 
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point  for  6ach  time  he  smiled,  another  point  for  each  S_  who  looked 
at  the  smile ,  and  one  additional  point  if  the  subject  was  smiling  and 
looking  reciprocally  at  another  S„ 

(d)  Positive  Affect  Received-  A  person  who  smiles  indicates 
that  he  is  receding  the  positive  affect  which  he  himself  generates, 

A  person  who  sees  another  person  smile  receives  the  positive  affect 
from  that  ether  person,  And  there  is  additional  positive  affect  received 
when  a  person  locks  reciprocally  at  another  who  is  smiling,  Thus  a 
person  received  one  “Positive  Affect  Received"  point  if  he  smiled, 
another  if  he  watched  someone  else  smile,  ard  an  additional  point  if 
he  was  looking  reciprocally  at  another  person  who  smiled, 

(e)  and  (f)  Thn  logic  and  scoring  of  the  Negative  Affect  Given 
and  Negative  Affect  Received  parallels  that  for  positive  affect.  The 
only  difference  is  the  nature  of  the  facial  expression  involved.  Each 
affect  score  could  range  from  zb  to  to  seven, 

(g)  and  (h)  Activity  levels  Several  behaviors  were  classed 
as  activities,  some  of  which  overlap  within  tne  influence  behaviors. 

The  following  behaviors  received  one  point  each  as  indicators  of  activity 
talking  (numcer  of  times),  writing,  moving  of  hards/arms,  moving  of 
body  (posture  or  bedy  orientations ) ,  and  movement  of  head.  For  each 
person,  two  activity  scores  were  computed  (for  each  minute),  a  total 
activity  score  (as  above)  and  a  sccre  of  activity  without  talking.  These 
scores  ccuid  range  from  zero  to  ae/en. 

This  scoring  of  the  data  produced  eight  scores  for  each 
for  each  minute  of  his  group's  activity,  Thus,  a  serial,  minute-by- 
minute  record  exists  for  each  group  on  each  of  the  three  variables, 
influence,  affect,  and  activity,  the  scores  were  appropriately  ad¬ 
justed  to  account  for  the  numerical  difference  between  3-  and  u^man 
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groups,^  Sums  of  these  minute-by-minute  scores  were  combined  for 
certain  purposes  of  analysis,  as  described  below. 

ror  purposes  of  multimethod  comparisons  certain  other  scores 
were  computed  from  the  eight  raw  event  records  scores.  Net  influence 
was  computed  as  Influence  given  less  influence  received.  These  scores 
could  range  from  nine  to  minus  nins.  Net  affect  was  computed  as 
positive  affect  given,  less  negative  a‘*ect  giver,  (ranges  seven  to 
minus  seven).  The  activity  score  used  in  multimethod  comparisons 
was  tctal  activity. 

It  will  be  nr>ted  that  the  scoring  system  detailed  above 
clffers  substantially  from  the  observation  systems  previously  used 
in  small  group  research.  The  above  method  is  an  objective  system  for 
noting  observable  behaviors  and  for  scoring  those  behaviors  as  indi¬ 
cators  of  useful  psychological  concepts.  Other  systems  of  group 
observation  require  experimenters  to  interpret  the  behaviors  as  they 
observe  them.  The  present  system  is  considerably  less  inferential 
and  offers  f's  possibility  of  much  mc.ro  reli-abie  measurement  than  other 
available  gro.p  process  analyses. 

Products 

Cach  hand-written  product  was  typewritten  an o  tnen  rated  cr 
eac-h  of  2C  scales  by  each  of  the  three  independent  Judges  (after 
Hackman,  1966  „  The  2.0  scales  represent  six  task-independent  product 
dimensions  (three  rating  scales  ea;n)  and  two  task,  dependent,  product 
dimensions  (one  rating  scale  each).  A  copy  of  the  form  cn  which  the 
products  were  typed  and  scored  may  be  fcund  in  the  appendix. 

^Eee  appended  rote  fcr  a  dlse.es, nr  of  i  vs.  4-men  groups 
in  this  study. 
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The  products  were  divided  into  two  sets  of  about  3C0  products 
each,  Cr6  set  consisted  of  all  products  produced  by  the  first  nine 
experimental  croups  (three  production  type*  three  discussion,  and 
three  problem-solving > ,  plus  19  products  from  lowers  study.,  The  other 
set-  consisted  of  ail  products  from  the  other  nine  groups,  and  Bi 
different  products  from  Lowe’s  study. 

The  rationale  beh'nd  including  scire  duplicates  of  Lowe  s  products 
in  these  product-sets  was  Lowe’s  product  sets  consisted  of  many 
products,  to  each  of  only  12  tasks,,  This  allowed  *cr  considerable 
variety  of  response  within  tasks,  but  relatively  little  variety  be¬ 
tween  tasks,  "'he  sets  of  products  generated  by  groups  in  the  present 
study  consisted  op  few  responses  to  each  of  108  tasks.  This  allowed 
for  product  sets  with  low  variety  of  response  within  tasks,  but  high 
variety  between  tasks.  It  was  tnought  worthwhile,  especiaviy  for  Lowe  s 
research,  to  investigate  whether  or  not  these  two  different  judging 
contexts  affected  the  judgments  made  by  tne  raters. 

Raters  were  trained  for  their  work  according  to  the  procedures 
developed  by  Har-kiran,  Each  judge  was  given  a  written  description  of 
the  scale  on  which  he  was  to  rate  a  set  of  products,  lilner.  -udge 

had  read  the  description  of  the  scale  he  sorted  a  set  of  sample 
products,  for  which  criterion  values  we  e  available.  He  then  compared 
his  judgments  to  the  criterion  judgments.  Any  oi scsepancies  were 
discussed  with  ore  of  the  experimenters  or  with  a  mere  experienced  judge 
until  the  discrepancies  were  satisfactorily  resoled.  Criterion  values 
were  not  available  for  the  sample  products  on  the  task-dependent  scales 
so  it  was  impossible  to  compare  a  judge  s  ratings  with  a  criterion. 
However,  ail  Judges  who  rated  or.  Adequacy  or  Creativity  discussed  the 
nature  of  the  scale  and  several  representative  products  with  one  of  the 


3? 

experimenters  before  proceeds  5  u.  tr  the  ratings. 

In  ail.  20  assistants  were  . r-vr*  ed  in  the  mating  of  predicts. 
Some  judges  ratea  products  or  several  different  sca-.es,  within  the 
?  allowing  limits*  (1 }  No  judge  rated  products  on  two  scales  in  a  single 
dimension  ,  (2 )  Nc  judge  rated  proc-.-r.ts  or  dif  ferent  scales  in  a  single 
t-at.;nq  sees  on.  and  (3)  No  udgs  rated  products  on  ary  cf  the  task* 
independent  scales  after  ha  .re  rated  product;.  or  a  task-dependent 
9  ca.’i  0 . 

In  general,  it  took  a  rater-  twr-  aref  ore-half  to  three  hours 
to  rate  1  set  of  products.  In.  the  cases  cf  Adequacy  arc  Creativity 
it.  often  took  ♦  ive  to  six  hours  for  a  judge  to  rate  a  smote  set  of 
products.  This  occasionally  ''squired  that  a  judge  take  two  sessions 
on  consecutive  days  to  rate  a  set.  o'  products  or.  one  or  these  task- 
dependent  scales. 

This  three  ratings  given  each  product  on  each  of  the  stales 
were  averaged  for  each  product  and  adder  • 0  the  a  e raged  ratings  for 
each  of  ‘he  two  other  highly  correlated  scales.  rn.is  yielded  eight 
scores  for  each  croquet,  six  task -independent  scores  and  two  task- 
dependent  r  -  ref  ,  Pi  scores  could  arcs  *:  -n  one  r.o  seven. 


CHAPTER  III  „ 


RESU.JS 


The  amount  or  systematic  data  collected  made  a  large  number  of 
analyses  possible*  The  complete  analysis  of  these  data  is  a  task 
which  would  consume  several  years*  Many  substantive,  as  well  as 
methodological,  questions  could  be  ashed  of  these  data.  A  complete 
analysis,  however,  would  obscure  the  major  features  of  this  research, 
burying  the  primary  findings  beneath  numerous  interesting,  but  secondary 
results* 

In  an  effort  to  focus  efficiently  on  the  major  questions  involved 
in  the  research,  certain  mam  analyses  were  given  priority  and  are 
presented  in  this  chapter.  These  analyses  speak  to  the  measurement 
and  methodolcg. cal  questions  presented  in  the  Introduction.  Several 
supplementary  analyses  have  been  performed,  but  are  reserved  for  brief 
mention  in  Chapter  >/* 

This  chapter  begins  with  a  brief  presentation  of  some  internal 
considerations  of  the  measurement  methods*  These  include  appropriate 
aspects  of  measurement  reliability*  The  correspondence  of  the  Inter- 
personal  Adjective  Checklist  to  a  circumpiex  model  of  personality 
is  presented,  for  the  products,  the  comparability  cf  the  present 
study  tc  earlier  work  by  Hackman  is  considered* 

"h"“8  mam  data  analyses  are  rather  global  considerations  of  blocks 
of  data*  Convergent  and  discriminant  validity  of  the  measurement 
methods  are  examined  for  selected  measuremer’  occasions*  The  complete 
multitrait-mul timethod  matrices  for  eight  occasions  are  presented. 
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The  main  effects  of  time  of  each  of  the  three  traits  are  presented 
as  measured  Oy  each  of  the  methods.  Similarly,  the  main  effects  of 
stress  on  each  of  the  three  traits  are  presented  as  measured  by  each 
of  the  methods,,  The  written  products  were  also  analyzed  for  ejects 
of  time  and  stress. 


Internal  Analyses 
of  the  Measurement  Methods 


Ratings 

Pairs  of  questions  were  combired  from  the  rating  forms  to  yield 
measures  of  influence,  affect,  and  activity.  While  no  attempt  was  made 
to  sampio  identical  behaviors  within  each  peit  of  questions,  the  .elation 
ship  between  questions  in  a  pair  is  of  interest. 

the  extent  of  agreement  between  any  pair  of  questions  varied 
from  one  measurement  occasion  to  another.  A  systematic  sample  fviz. 
for  the  first  segment  of  the  eight  experimenta-  tours)  of  the  Inter- 
correlations  is  presented  in  Table  3. 

The  two  influence  items  correlated  strongly  fcr  seif  ratings, 
peers,,  and  observers.  The  strength  of  the  correlation  is  lowest  fcr 
self,  next  for  peers,  anc  highest  for  observers.  The  ;»c  *  feet  -terns 
were  very  strongly  correlated  for  all  types  cf  ratings,  althougr 
somewhat  less  so  for  observers  than  for  seif  cr  peer  ratings.  The  two 
activity  items  were  not  as  strongly  related  as  the  other  pairs,  although 
the  mean  correlations  indicate  significant  positive  relationships  for 
all  types  of  ratings.  The  activity  items  appear  to  have  teen  mere 
strongly  related  fcr  the  observers  than  for  the  subjects. 

The  differences  in  some  these  correlations  ( between  selr, 
peers,  and  observers)  seem  tc  indicate  that  the  rating  scales  h aa 
different  meanings  for  different  raters.  This  is  most  strikingly 
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TABLE  3 

CORRELATIONS  BETWEEN  PAIRS  OF 
RATING  QUESTIONS  FOR  SELECTED  MEASUREMENT  OCCASIONS 


Occasion®  Saif  _ Pears  Observer 


Txait° 

Inf. 

Aff.  Act. 

Inf. 

Aff. 

Act. 

Inf. 

Aff. 

Act. 

N 

1 

62 

.58 

.94 

,33 

.68 

.93 

.49 

.66 

.85 

.64 

5 

62 

.57 

.93 

.38 

.62 

.91 

.57 

.56 

.64 

.67 

8 

62 

.65 

.96 

.32 

.80 

.95 

.58 

.78 

.89 

.77 

12 

62 

.59 

.94 

.29 

.67 

.96 

.52 

.78 

.81 

.65 

15 

62 

.54 

.96 

.39 

.55 

.96 

.55 

.65 

.96 

.57 

19 

62 

.65 

.98 

.23 

.79 

.95 

-.10 

.64 

.94 

.38 

22 

57 

.77 

.99 

.50 

.76 

.97 

.58 

.78 

.86 

.56 

26 

57 

.50 

.99 

.49 

.42 

.94 

.51 

.79 

.97 

.58 

Mean 

.59 

.  96 

.37 

.66 

.95 

.46 

.71 

.87 

.60 

a 

These  are  the 

first  segments 

of 

each  hour 

0 

^Key::  Inf.  Influence,  Aff  .  s  Af  fect,  Act,  s  Activity. 
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true  in  the  case  of  the  activity  items  *  These  differences  exist  in 
spite  of  the  fact  that  the  u/ording  of  the  items  and  the  instructions 

given  were  identical  for  all  three  types  of  ratings. 

Interpersonal  Adjective  Checklist 

(a)  Conformity  to  the  circumpiex  models  The  Interpersonal 
Adjective  Checklist  is  built  on  a  circumpiex  model  of  personality 
(Leary,  1957).  figure  3  portrays  the  circumpiex  model  with  octants 
designated  Doth  by  letter  and  by  trait 0 

Table  4  presents  the  intercorrelations  between  octants  of  the 
Interpersonal  Adjective  Checklist  generated  by  the  present  data. 

Adjacent  octants  did,  in  general,  correlate  as  intended,  The  sole 
exception  is  FG-HI.  Correlations  between  opposite  octants  are  not 
strongly  neqative  as  the  Interpersonal  Adjective  checklist  model 
predicts.  They  tend,  rather,  to  approach  rero.  Non-adjacent,  non¬ 
opposite  octants  should  be  correlated  at  intermediate  values,  decreasing 
around  the  circle  from  adjacent  to  orthogonal  to  opposite  octants  and 
th6n  increasing  tack  to  adjacent.  This  patterning  is  strong  in  ‘ab-e  4. 
Only  a  few  entries  break  this  order,  none  of  them  by  more  than  ,05  units. 

So,  the  octants  fit  a  circumpiex  ordering,  with  one  exception. 
However,  opposite  octants  faiiea  to  generate  negative  correlations. 

One  available  explanation  fot  this  set  of  outcomes  is  that  the  "Suo- 
missive’'  (Hi)  octant  ana  the  octants  adjacent  to  it  are  only  infrequently 
endorsed  fcy  college  males  (LaForge,  1963).  A  similarly  infrequent 
endorsement  of  submissive  characteristics  occurred  in  the  present 

study.  With  low  frequency  of  response  in  these  octants  there  may  be 
insufficient  variability  within  these  octants  to  allow  strong  corre¬ 


lations. 


a2 


NO:  Responsible 


Competitive 


LM  Cooperative 


Aggressive 


JK?  Docile 


FC  Skeptical 


SUBMISSION 


MATE 


DOMINANCE 


-  LOVE 


Figure  3.  Circump^e*  Model  of  Personality 
(after  Leary,  195°) 


43 


TABLE  4 

INTERCORRElATIONS  BETWEEN  OCTANTS 
OF  THE  INTERPERSONAL  ADJECTIVE  CHECKLIST 
N=5° 


Octant 

AP 

BC 

DE 

FG 

HI 

JK  LM 

NO 

AO 

1,00 

BC 

.48 

1.00 

DE 

.35 

.53 

1.00 

FG 

o 

CM 

o 

.26 

,51 

1,00 

HI 

.21 

,01 

-.05 

-.02 

1.00 

JK 

.16 

-.05 

-.10 

-.02 

.51 

1,00 

Lffl 

.34 

-  .01 

-.07 

-.11 

.33 

.50 

1.00 

NO 

.39 

.01 

-.09 

-.02 

.30 

.54 

,68 

1.00 

(tO  Reliability:  Because  of  the  interlocking  relationship 
between  the  present  study  and  that  of  Rosemary  Lowe,*  it  was  possible 
to  examine  the  Interpersonal  Adjective  Checklist  for  both  test-retest 
and  split-tsif  reliability.  At  the  end  cf  the  Lcwe  study  all  Ss 
took  a  shcrter.eo,  one-page  form  of  the  I  CL  which  included  ~aif,  ultnougr. 
not  a  perfectly  represertati .e  half,  ct  the  ICL  items.  This  one  ►  ig<= 
form  be  tame  the  hirst  page  the  fuii  two  page  ICL  administered  to 
Ss  who  participated  in  the  present  study,  ^hus,  test-retest  re.iabiltty 
can  be  assessed  from  Ss*  responses  in  the  Lowe  9tudy  vs.  responses  in 
the  presort  study,  ^etie  5  presents  these  test  retest  reliabilities, 

by  octant  of  the  ICL. 

Split-naif  reliability  was  calculated  by  comparing  Ss  responses 
c-n  page  ore  of  t*-e  ICl  with  these  or  page  two  within  the  present  study. 
Table  6  presents  these  spiit-haif  reliabilities. 

*oee  apeenaea  rate  on  the  re-ationehip  of  this  3tud*  to  that  of 
Rosemary  lows. 
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TABLE  5 

TEST- RETEST  RELIABILITIES  OF  THE  ICL 


AP 

.63 

BC 

.4? 

DE 

.50 

FG 

.58 

HI 

.44 

JK 

.61 

Ll« 

.68 

NO 

.60 

TABLE  6 

SPLIT-HALF  RELIABILITIES 

OF  THE  ICL 

AP 

BE 

DE 

FG 

HI 

JK 

Lffl 

NO 

.57 

.5? 

.55 

.46 

.45 

.24 

.69 

.63 

The  reliabilities  were  understandably  attenuated  by  comparison 
with  the  published  reliabilities  of  the  ICL.  Test-retest  reliabilities 
were  attenuateo  by  (1)  the  two-  to  four-month  lag  between  test  and 
retest,  an  abnormally  long  interval  for  reliability  testing,  and 
(2)  tne  fact  that  only  half  of  the  test  could  be  used  In  this  reliability 
calculation.  Split -half  reliabilities  may  have  been  attenuated  because 
the  two  nalves  were  not  balanced*  Thus,  the  reliabilities  presented 
could  well  be  considered  as  lower  bounds  of  reliability  or  the  wi- bants 
of  the  I  CL. 

It  must  t-e  remembered  that  any  test,  but  especially  a  test  fcased 
upon  an  interpersonal  personality  theory.  Is  affected  by  th6  situation 
in  which  it  is  administered*  This  interaction  of  the  test  with  tho 
situation  is  customarily  interpreted  as  unreliability.  However,  the 
test  may  in  fact  be  reflecting  the  changing  situation.  This,  and  other 
possibilities,  will  be  considered  in  Chapter  V. 


45 


Event.  Records 

Foe  the  event  recoros.  reliability  way  be  considered  virtually 
perfect,,  Ihe  events  recorded  mere  talking,  looking  at  (line  cf  regard), 
various  body  movements ,  and  facial  expression.  Tabulations  of  talking 
were  simply  a  matter  cf  rcting  who  was  speaking  and  (later)  counting 
the  number  of  times  each  person  talked  during  a  15-minute  period. 

Given  clerical  accuracy,  these  values  are  numerically  precise. 

Looking  at  (line  of  regard),  body  movements,  and  facial  expression 
were  taker  from  the  films.  Experience  and  spot-checking  demonstrated 
tnat  independent  film  viewers  could  agree  virtually  perfectly  on  these 
variables,  Crce  these  events  were  recoraed,  the  scoring  system  left 
no  room  for  interpretation  or  judgment  errors. 

Products 

Of  major  importance  is  the  reliability  among  judges  who  rated 
the  written  products.  The  ratings  of  each  rater  or  a  set  of  products 
were  Jntercosre.atea.  Correlations  between  pairs  of  raters  were  then 
pooled  and  transformed  according  to  the  Spearman- Brown  prophecy 
formula  to  gJ  e  »  reliability  estimate  of  the  pooled  judgments  of  the 
raters.  The  reliabilities  of  the  pooled  judgments  are  presentee  in 
Table  7, 

reliabilities  range  from  .80  to  .99,  based  upon  six  pairs 
cf  raters  for  each  scale.  These  reliabilities  inaicate  high  agreement 
among  raters  and  compare  favorably  with  rater  reliability  in  Hackman's 
earlier  wer*  in  this  area  (Hackmar „  1966', 

Sec-es  comparable  to  those  in  Hackman  s  study  were  achieved  by 
Ss  working  jrder  normal-. oad  conditions.  'able  8  presents  Hackman's 
product  scores  for  each  tae>  type  and  dimension.  Parallel  columns 
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TABU  7 

reliabili ties  of  the 


POOLED  JUDGMENTS  Or 
N=6 

PRODUCT 

RATERS 

Dimension 

Scale 

I 

II 

III 

Action  Orientation 

o  97 

.96 

.92 

Length 

o  99 

a 

.97 

Originality 

.  86 

.91 

.91 

Outlook 

.06 

.03 

.91 

Quality  of  Presentation 

.93 

.07 

o 

03 

Involvement 

CD 

o 

.07 

.90 

Adequacy*3 

.05 

Creativity*5 

.04 

aThis  s cols  is  operational  ir.  c  *.  ’t 

^AdeQuac'  and  crest  u-it y  are  represented  by  only  one  scale  eacH. 
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TABlE  8 

COMPARISON  OF  MEAN  PRODUCT  SCORES 
8E  rttf£TN  HACKMAN’S  (1966)  S^UDY  AND  THE  PRESENT  RESEARCH 


Means 


Dimensions 

Production 

Discussion 

Problem  Solving 

Hackman 

Stapert 

Hackman 

Stapert 

Hackman 

Stapert 

Action  Orientation 

2.75 

2.34 

3.95 

3.43 

5,89 

6.13 

Length 

4.73 

4.71 

3.59 

2.72 

3,18 

2.91 

Origins  Lit  > 

4„76 

4.32 

2 . 7  2 

2.47 

^.49 

2,32 

Out  1 ocU 

3.88 

3.97 

4 ,  J.  3 

3.93 

4 . 64 

4.32 

Quality  c.F  Presentation  5.07 

5.Gi 

4.38 

4,73 

3.77 

4.47 

Ir.vo]  ^emen  t. 

3  „  06 

2.23 

5,  39 

4.88 

3,93 

3.62 

Adequacy 

5.11 

4.9a 

4.97 

4,39 

5.10 

4.90 

Cleat  1  -  Lt/ 

3.09 

4.31 

2.60 

t  * 
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present  mean  product  scores  achieved  under  normal  load  conditions  in 
the  present  study. 

Inspecting  Table  8  shows  the  consistency  between  Hackman's 
work  and  the  present  study.  Wore  striking  than  the  proximity  of 
paired  mean  values  is  the  patterning  of  the  means.  This  Table  shows 
Action  Orientation  to  be  the  characteristic  scale  for  problem-solving 
tasks,  Originality  for  production  tasks,  and  Involvement  for  discussion 
tasks  for  both  studies.  All  indications  are,  therefore,  that  ratings 
made  on  the  products  are  reliable  and  that  a  high  level  of  comparability 
exists  tetween  the  present  study  and  earlier  work  on  these  laboratory 
tasks. 


Convergent  ana  Discriminant  Validity 


Convergent,  ard  discriminant  validity  of  the  measurement  methods 
may  be  examinee  in  the  multitrait~mul timethod  matrix.  ’he  matrix  for 
the  first  measurement  occasion  (the  first  segment,  first  hour,  first 
week,  normal  load)*  is  presented  in  Table  9.  Several  features  of 
this  matrix  mav  be  noted.  (The  major  diagonal,  reliability  coefficients, 
of  this  and  other  matrices  is  omitted,  since  different  types  o*  relia¬ 
bility  are  relevant  to  different  methods  Reliabilities  are  discussed 
in  the  earlier  section  of  this  chapter,) 

The  Interpersonal  Adjective  Checklist  scores  failed  to  snow  any 
pattern  or  correlate  well  with  any  otner  method.  The  failure  of  this 
test  to  converge  with  other  methods  was  a  consistent  finding  throughout 


+ '"Segment  refers  to  a  period  of  time  during  which  a  group  worked 
on  a  task  and  at  tne  end  of  wnich  ratings  were  taken.  The  length  of 
a  setment  was  variable  during  normal  lead  hours,  beginning  and  ending 
when  a  group  ceoar  and  ended  a  task.  Coring  the  stress  hours,  segments 
were  15  minutes  long. 


NULH  TRAIT- MULT  I  METHOD  MATRIX 
FOP  OCCASION  1  N-62 
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the  several  other  muititrait-multimethod  matrices*  presented  below. 

This  is  net  surprising*  since  the  relatively  lew  reliabilities  of  the 
ICL,  already  noted*  represent  upper  limits  for  all  the  correlations 
in  the  multitrait-multimethod  matrix. 

Convergent  validity  among  the  other  methods  ranges  from  near- 
zero  tc  moderate.  Convergence  of  methods  for  affect  is  very  weak  in 
most  cases.  Only  peer  ratings  and  event  records  show  a  significant 
muitimethed  convergence  for  affect*  („3l)„ 

flflooerate  ccr,  ergence  on  influence  and  activity  is  shewn  between 

methods  at  several  points.  Self  ratings*  peer  ratings,  and  observer 

\ 

ratings  converge  or.  these  traits.  But  only  the  observer  ratings  shew 
significant  convergence  with  the  event  records.  So*  there  seems  to 
be  convergence  between  some  methods  on  some  traits,  but  not  between 
all  pairs  of  methods  nor  or  all  traits. 

If  a  measurement  method  is  valid,  it,  must  not  only  converge  with 
cthey  measures  of  the  same  trait,  but  must  Discriminate  between  tnat 
trait  and  ether  traits.  An  inspection  of  the  he tero trait -monomethod 
values  ir  'able  9  snows  tnat  all  three  rating  methods,  especially 
seif  and  peer  ra  logs,  fail  to  discriminate  between  traits, 

yJithin  each  rating  method*  Influence  <sr  u  activity  are  higray 
correlated.  This  Is  not.  surprising.  The  high  correlations  are  partly 
artif actual*  since  'talking"  counts  both  as  influence  and  as  activity, 

Irt  adc  ,tion,  correlations  as  high  as  ,56  stand  between  inf  luence 
and  attest  and  between  affect  and  activity  within  the  rating  methods. 
These  high  ccr relations  are  signals  of  method  variance,  generated  when 


different  traits  are  measured  by  a  single  method, 

>e  Trte  .'personal  Adjective  Checklist  and  the  event  records 
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discriminated  quite  well  between  traits,  except  for  a  similar  "built- 
in"  correlation  between  activity  and  influence  for  event  records „ 

For  the  Checklist  this  has  limited  significance,  since  it  failed  to 
converge  for  any  trait  with  any  other  method.  But  for  the  event 
renorcs,  which  tend  to  converge  with  observer  ratings  ''or  at  least 
some  traits,  the  discrimination  is  important, 

Heterotrait-heterumetned  correlations,  which  share  neither  trait 
nor  method  variance,  are  expected  to  be  low.  Some  are  low  in  Table  9,  but 
others  rise  into  tne  fifties.  The  higt  heterotrait-heteromethcd 
values  occur  primarily  between  influence  and  activity  and  are  concen¬ 
trated  in  the  rating  methodologies,  Tbls  may  indicate  a  close 
relationship  between  influence  and  activity.  It  may  also  indicate 
the  presence  of  method  variance  among  rating-scales  in  general,  in 
addition  to  method  variance  within  specific  ratings  (e,g,  self-ratings). 

Multitrait-multimethod  matrices  for  the  first  measurement  occasion 
for  each  experimental  hour  are  presented  in  "ables  1G  through  16, 
Inspection  of  these  tables  indicates  that,  while  specific  values  vary 
from  fable  to  table,  the  patterns  noted  in  Table  9  seem  to  be  quite 
consistent  from  r  ne  c  cession  to  anot  her. 

The  artif actually-inflated  correlations  between  influence  >no 
activity,  ncted  earlier,  may  have  obscured  convergence  and  discrimination 
among  tne  methods.  This  artifact  was  removed  for  a  subsequent  analysis 
by  using  only  question  2  on  the  rating  forms  as  the  measure  of  activity 
for  those  methods,  and  dy  using  an  activity  score  without  talking  for 
the  event  records.  Tables  1?  through  7*  present  the  multitra.it- 
multimetnod  matrices  generated  bv  this  second  analysis. 

As  expected,  all  correlations  involving  activity  as  a  variable 
are  lower  than  in  the  earlier  analysis,  Mary  of  the  correlations  in 
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6? 


the  previous  analysis,  which  were  statistically  significant  and 
appeared  to  show  moderately  high  convergences  among  the  rating 
methods,  show  insignificant  correlations  when  the  activity-influence 
overlap  is  removed, 

A  case  in  point  is  found  in  Segment  1  (Tables  9  and  17), 

In  Table  9,  which  includes  the  artifact,  self  ratings  and  peer  ratings 
correlate  ,48,  a  significant  and  moderately  strong  convergence.  But 
in  Tatie  17,  which  has  eliminated  the  artifact,  the  corresponding 
correlation  is  ,19,  which  is  not  significantly  different  from  zero. 
Numerous  other  cases  in  this  ana  other  pairs  of  matrices  show  the  same 
pattern . 

In  this  second  analysis,  there  is  still  seme  convergence  between 
self  ana  peer  ratings  for  influence  and  activity.  Observer  ratings 
and  event  records  converge  strongly  for  influence,  but  not  activity. 

7h»  validity  of  self  and  peer  ratirgs  is  still  blemiehed,  however,  by 
high  method  variance  (high  heterotrait -monomethod  values).  The 
observer  ratings  diecriminate  somewhat  better  end  the  event  records 
over  rim  Irate  very  well. 

It  is  worth  noting  that  an  unusual  operation »  def  inition  nf 
activity  had  to  be  invoked  in  order  tc  slim  t.f«»  «rt.' factual 

relationship  between  Influence  and  activity.  Activity  in  an  interper¬ 
sonal  situation  is  frequently  defined  in  terms  cf  talk  time  or 
communt cation  rate..  But  here,  activity  came  to  mean  bodily  movements 
onlys  writing,  change  of  posturs,  gestures,  fiddling  with  pens, 
nervous  twitches,  etc.  ^hua,  while  this  secoro  multitrait -multimethod 
analysis  exposes  the  lack  of  convergence  among  measurement  methods 
more  clearly  than  the  first  analysis,  Its  definition  of  tne  activity 
trait  is  less  closely  related  to  previous  reseeren. 
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The  artifact  could  have  been  removed  by  eliminating  talking 
from  influence,,  rather  than  from  the  activity  scores,,  This  woulc 
have  left  activity  conceptually  sound-talking  and  ether  motor 
actions „  But  it  would  have  defined  influence  as  not  including 
effects  cf  verbal  communication— again ,  conceptually  quite  different 
from  the  usual  meaning  of  the  influence  construct. 

In  3um?  there  is  some  convergent  validity  in  the  measurement 
methods  used.  Specifically,  seif  and  peer  ratings  converge  for  influence 
and  activity,  but  not  for  affect.  This  convergence  is  stronger  for 
influence  than  activity.  Observer  ratings  converge  with  self  and  peer 
ratings  or  influence,  but  not  on  affect  or  activity.  Observer  ratings 
converge  with  event  records  when  measuring  influence.  No  other  methods 
converge  with  event  records  on  any  variable.  No  methods  converge 
with  the  Interpersonal  Adjective  Checklist,  at  any  point.. 

However,  the  rating  methodologies  are  plagued  by  method  variance. 
In  gererai,  they  fail  to  discriminate  between  traits,  although  the 
discrimination  between  traits  is  better  for  observer  ratings  than 
for  self  or  peer  ratings.  Lack  cf  discriminant  validity  is  further 
suggested  by  the  presence  of  many  relatively  hign  heferotrait- - 
heteremetnod  correlations. 

The  event,  records  discriminate  quite  well  between  traits. 

This  discrimination  is  better  for  affect  than  for  influence  and  ac¬ 
tivity.  They  also  .shew  statistically  significant  correlations  for 
convergences  for  influence,  and  in  same  cases  for  activity,  with 
observer  ratings.  The  heterotrait-heteromethod  correlations  for  these 
two  methods  on  these  two  traits  tend  to  be  low,  relative  to  heterotrait- 
monomethoa  and  heteromethod-mcnotrait  correlations. 
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Changes  Over  Time 


Behaviors 

The  foregoing  section  of  this  chapter  exposed  the  widespread 
method  variance  and  frequent  lack  of  convergent  validity  which  plague 
seme  common  measurement  methods  in  small  group  research.  In  spite 
c.f  these  measurement  problems,  however,  the  data  seem  worth  examining 
for  effects  occurring  ever  time.  The  ratings  and  event  records,  which 
are  associated  with  specific  time  intervals  during  the  research, 
provide  an  opportunity  to  evaluate  temporal  effects  on  subjects 
behaviors  . 

The  mean  scores  (from  ratings  and  event  records'  for  each 
observation  period  were  averaged  to  provide  an  heur-by-hour  score  tor 
each  trait -method  combination.  The  mean  scores  or  influence  and  their 
standard  errors  for  the  normal-lead  hours  are  presented  in  able  25. 


TABLE  25 

MEAN  INFLUENCE  SCOPES  FOR  NORMAL-LOAD  HOURS 


Week 

Method 

T 

II 

III 

Iu 

M  es 

t  < 

M 

est.a/ 

M 

6S  t  (,C yi” 

M 

6S  t .  <T  ,Y|a 

Self  Ratings 

4.88 

o  81 

4 .  Si 

o  •  f 

*♦  J 

.33 

1  n  ii 
•>  4  CJ 

.79 

Peer  Ratings 

4.89 

.67 

4.  85 

.65 

4.90 

.55 

4.80 

.61 

Obser  er  Ratings 

4.55 

<,84 

4.29 

1 ...  08 

4.6? 

.96 

4.53 

.98 

Event  Records 

1.63 

.69 

1 . 51 

i.,09 

1.84 

.83 

1.54 

.  82 

a'!he  reported  means  were  computed  as  the  average  of  the  three  mean 
scores  of  the  hour.  The  appropriate  tnce*  of  variation  is  the  standard 
error  of  the  mean  (est-oC^'c 
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A  one-way  Analysis  ef  Variance  (ANOV'A)  u/as  performed  on  each 
variable,  none  of  them  yielding  a  significant  F-ratio  for  influence. 

All  statistical  comparisons  among  these  means  failed  to  find  significant 
ditr  ferenues  . 

A  similar  presentation  of  affect  scores  is  contained  in  Table  26. 
Concerning  affect,  ANQVA  results  show  a  significant  change  over  time 
for  observer  ratings.  Specifically,  the  final  week  is  significantly 
higher  (p^.01)  than  any  (and  all)  other  weeks.  None  of  the  other 
measurement  methods  showed  a  significant  change  over  time.  The  self 
ratings  ana  peer  ratings  parallel  tn6  rise  of  the  observer  ratings 
during  the  final  week,  although  not  sharply  enough  to  be  statistically 
significant. 


TABLE  26 

WEAN  AFFECT  SCO PE 5  FOR  NORMAL-LOAD  HOURS 


Week  I  II  III  IV 


Method 


M 

est.^ 

M 

est.Ojy)3  M 

est.0^a  M  est 

a 

Self  Ratings 

5.00 

.60 

5.12 

.66  5.07 

.77 

5.19  . 

03 

Peer  Ratings 

5.12 

.48 

5.10 

.41  5.14 

.55 

5.20 

65 

Observe?  Ratings 

4,52 

.49 

a. 4? 

a-'  a .  50 

,51 

4.96 

84 

Ever  *  P.r. 

u. 

,27 

0.11 

J3  P .  1 5 

,19 

0  15 

23 

^The  reporteu 

means  were  comp. 

,teo  as 

the  average  of  the  tnree  mean 

scores  of  the  hour. 

The  a 

pprcpriate  index  of  variation 

is  the 

standard 

error  c*  the  mean  (est  CT  T 

ifi 


The  means  over  time  for  acti-  "ty  scores  and  their  standard  errors 

■c  presented  in  Table  77.  ANQVA •'  &  fox*  seii  ratings,  peer  ratings,  and 
•-  .r,  *  records  all  shorn  r  ‘ or ’ f  t r«r t  changes  over  time  in  activity  level. 
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Specifically*  the  self  ratings  shorn  a  significant  (p£o05)  rise  between 
weeks  II  and  III.  This  is  parallelled  by  a  significant  (p£.02)  rise 
in  peer  ratings  at  the  same  point.  The  rise  in  event  records  between 
the  second  and  third  weeks  is  significant  at  the  .01  level.  Only  the 
observer  ratings,  which  present  a  different  pattern  in  mean  values  over 
time,  fail  to  show  a  significant  rise  between  weeks  II  and  III. 

TABLE  27 


WEAN 

ACTIVITY 

SC0RE5 

FOR  NORWAt-LOAD 

HOURS 

aleak 

Wet  hod 

I 

II 

III 

IV 

W 

est.(rlY)a 

w 

0 

est.Ojfl 

W 

est .*7lfla 

W 

est.ffjy,® 

Seif  Ratings 

4.25 

.82 

4.  30 

.80 

4.46 

.81 

4.53 

.82 

Peer  Ratings 

4.18 

.64 

4.22 

.68 

4.46 

.63 

4C  44 

.67 

Observer  Ratings 

4.30 

.84 

3.97 

1.01 

4.27 

.99 

4.24 

.96 

Event  Records 

1.17 

.41 

1.12 

.40 

1.35 

.43 

1.27 

.48 

aThe  reported  means  were  computed  as  the  average  cf  the  three  mean 
scores  of  the  hour.  The  appropriate  irde:«  of  variation  is  the  standard 
error  of  the  mean  (,est.<yjy,)c 


Thus  ,  there  are  differences  through  time  ir  tr h  behavior? 
small  groups.  Affect  seems  to  rise  durinc.  four*1"  session,  tn- 
activity  clearly  rises  between  the  second  and  third  sessions.  In 
the  case  of  affect,  the  observer  ratings  show  a  significant  rise 
which  is  parallelled  by  a  similar,  though  non>significant,  rise  in 
seif  ana  peer  ratings.  In  the  case  of  activity,  three  of  the  four 
methods  snow  a  significant  rise  between  the  second  and  third  weeks. 
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Performance 

In  addition  to  changes  in  behavior,  groups  may  change  in  eir 
output  characteristics  over  time„  The  products  generated  from  the 
normal-load  hours  were  analyzed  by  Analysis  of  Variance  to  assess  the 
effects  of  time  upon  group  output*  The  three  task  types  were  differen¬ 
tially  affected  by  time,  so  the  data  for  each  task  type  is  presented 
separately*  The  relations  of  the  product  dimensions  to  time  for 
production,  discussion,  and  problem-solving  tasks  are  presented, 
respectively,  in  Tables  28,  29,  and  30* 

All  three  task  types  changed  significantly  in  Action  Orientation 
over  tine.  Inspection  of  the  means  shows  that  production  tasks  were 
lower  in  Action  Orientation  during  the  second  and  third  weeks  than 
during  the  first  or  last*  Discussion  tasks  rose  steadily  in  Action 
Orientation  over  the  four  sessions.  Problem-Solving  tasks  were  higher 
in  Action  Orientation  during  the  last  two  weeks  than  during  the  first 
two.  So,  while  ail  task  types  differed  In  Action  Orientation  over 
time,  they  differed  in  different  ways.  Action  Orientation  is  the 
only  scale  which  showed  significant  time  effects  for  problem  solving 
tasks „ 

Production  and  discussion  groups  charged  significantly  r  *r 
time  with  respect  to  Originality  and  Involvement.  On  Originality, 
inspection  of  the  means  shows  tnat  production  groups  decreased  over 
the  first  three  sessions,  but  then  increased  during  the  fourth  session. 
Discussion  groups  remained  virtually  constant  in  Originality  for  the 
first  three  vea*;s  end  increased  during  the  fourth  week. 

Inspection  of  means  shows  that  Involvement  increased  sharply 
during  Session  II  for  production  groups,  then  dropped  to  approximately 


WEANS  SCORES  FOR  PRODUCT  DIMENSIONS  AND  FOR 
FOUR  SESSIONS  AND  F-RATIOS  BETWEEN  SESSIONS  (PRODUCTION  GROUPS) 


1 

in 

in 

0 

rH 

0 

CM 

in 

0 

0 

o 

C 

o 

c 

o 

o 

c 

c 

CL 

Vi 

o 

VI 

vi 

o 

VI 

O 

P 

ON 

CO 

CD 

CD 

Or 

in 

CM 

P 

CD 

CM 

vD 

P 

vO 

H 

CM 

O 

a 

o 

<i 

o 

9 

o 

o 

9 

o 

04 

P 

CO 

CO 

P 

L 

> 

M 

CD 

H 

H 

in 

cr 

^r 

CD 

CD 

.X 

i> 

CT 

<r 

CT» 

cc 

cr* 

cr. 

0 

b 

o 

o 

o 

o 

o 

o 

o 

0 

CM 

o 

CO 

•o 

H 

< 

O' 

1 

3 

►-« 

HH 

fc— . 

O 

O 

CM 

CM 

CM 

cn 

H 

<r 

O 

in 

P 

O 

—1 

o 

CD 

vO 

i 

o 

• 

9 

o 

• 

9 

o 

9 

0 

>N 

K) 

in 

CM 

to 

£ 

m 

c 

(O 

£ 

P 

>— » 

Oi 

lO 

in 

CM 

o 

CD 

r4 

r- 

* 

CD 

CD 

CO 

P 

CT» 

CM 

in 

0 

e 

« 

9 

o 

9 

9 

o 

9 

£ 

r-i 

<5 

O' 

in 

eg 

in 

P 

r- 

CD 

CM 

CD 

o 

m 

CO 

vO 

to 

0- 

CM 

Ol 

t^ 

vO 

0 

o 

o 

o 

• 

o 

c 

9 

9 

0 

CM 

in 

CO 

o 

<P 

3 

c 

o 

’H 

P 

0 

c 

P 

o 

C 

P 

0 

0 

P 

0 

C 

0 

0 

o 

P 

u 

P 

c 

CL 

0 

0 

>N 

P 

c 

p 

p 

Ci. 

c 

>* 

0 

p 

o 

0 

P 

E 

O 

p 

E 

> 

P 

P 

0 

JC 

>* 

0 

o 

> 

Q 

c 

n 

c 

o 

p 

> 

0 

P 

o 

p 

P 

o 

p 

•P 

3 

P 

•H 

CP 

CP 

P 

H 

O 

cr 

0 

P 

c 

P 

P 

0 

> 

0 

0 

O 

O' 

U 

.  D 

3 

c 

TJ 

fo 

<x 

o 

O 

(3 

H- « 

<X 

LJ 

WEANS  SCORES  FOR  PRODUCT  DIWLNSIONS  AND  n.H 


74 


PAEANS  SCORES  FOR  PRODUCT  DIMENSIONS  AND  FOP  r 
SESSIONS  AND  F=RATIOS  BETUIECN  SESSIONS  ( PROBi.E M~SO  -iN  CROUP'S) 
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its  initial  level  during  Sessions  III  and  IV.  Fcr  discussion  groups, 
however,  Involvement  increased  steadily  over  the  Four  wests. 

Discussion  groups  alone  were  significantly  affected  on 
Outlook  and  Creativity.  Inspection  shows  that  outlook  was  considerably 
more  positive  during  the  second  week  than  at  any  other  session.  After 
the  second  week  it  dropped  to,  and  below,  its  initial  level.,  Creativity 
increased  over  all  four  sessions  for  discussion  groups. 

In  summary,  all  product  dimensions  are  affected  by  time  except 
Length  and  Adequacy.  But  only  Action  Orientation  is  a**ected  in  ail 
three  task  types,  u/hen  a  dimension  changes  over  time  in  two  or  three 
task  types,  it  changes  in  a  different  way  for  each.  There  are 
changes  on  group  performance  through  time,  but  the  changes  appear  to 
be  complex. 

Charges  Under  Stress 

Behaviors 

Each  weekly  experimental  session  contained  an  hour  of  ncrmal- 
load  activity  and  an  hour  of  stressing  work  load.  During  the  * irst 
ard  third  weeks  w  stressing  „oad  was  an  overload.  'V-  ‘  r. 
and  fourth  weeks  it  was  an  underload. 

Changes  in  behaviors  from  one  working  condition  to  aromer 
can  be  examined  on  the  basis  ot  the  ra’irgs  ard  e  ent  records 
Table  3i  presents,  in  parallel  columns,  the  means  ard  stard-v'd  errors 
for  normal  and  stress  conditions  on  Influence. 

Enamination  of  fable  31  reveals  a  strong  ♦erdency  for  Influence 
behavior  to  rise  under  stress  conditions.  Seven  cf  the  comparisons 
between  normal  arc  stress  conditions  are  sign  ficant,  with  t*-e  stress- 
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TABLE  31 

COMPARISON  OF  INFLUENCE  SCORES 
BETWEEN  NORMAL  AND  STRESS  CONDITIONS 


Method 

Week 

Normal 

Condition 

Stress 

Condition 

f 

M 

est.Ofo3 

M 

est.C'iY)8 

Self  Ratings 

I 

4.88 

.81 

5.30 

.73 

3.00c 

II 

4. 91 

.77 

5.00 

.99 

- 

III 

4.7? 

.83 

5.14 

.76 

2. 66c 

IV 

4.85 

.79 

4.97 

.72 

- 

Peer  Ratings 

I 

4.89 

.67 

5.29 

.54 

3.64 

II 

4.85 

.69 

5. 02 

.65 

• 

III 

4.90 

.55 

5.15 

.59 

2.50* 

IV 

4.80 

.61 

5.04 

.57 

2.27° 

Observer  Ratings 

I 

4.55 

.84 

5.15 

.74 

4.29c 

II 

4.29 

1.08 

4.61 

1.02 

- 

III 

4.67 

.96 

4.58 

.87 

- 

IV 

4.53 

.98 

4.67 

.91 

- 

Event  Records 

I 

1.63 

.69 

1.71 

.65 

- 

11 

1.51 

1.09 

1.67 

■» 

III 

1.84 

.83 

1.61 

.65 

«*» 

IV 

1.54 

.82 

i.as 

.86 

2«ui 

aThe  reposted  means  were  computed  as  the  average  cf  the  three  mean 
scores  of  the  hour.  The  appropriate  index  of  variation  is  the  standard 
error  of  the  mean  (eat„crM)„ 

m 

DP<  o05 
CP<  .01 
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hour  score  higher  than  the  normal  hour  score  in  every  case.  Of  the 
remaining  nine  (insignificant)  comparisons,  seven  means  are  higher 
under  stress  than  under  normal  conditions. 

While  the  tendency  for  influence  to  rise  during  the  stress  hours 
is  general,  the  significant  comparisons  seem  to  cluster  in  the  first 
and  last  weeks.  Three  methods  (self,  peer,  and  observer  ratings) 
indicate  a  significant  rise  during  Week  I.  Peer  ratings  and  event 
records  show  a  significant  increase  in  influence  during  Week  IV. 

No  significant  change  in  influence  is  indicated  during  Week  II, 
although  all  four  methods  show  a  tendency  to  rise  between  normal  and 
stress  hours.  While  both  self  and  peer  ratings  agree  on  a  significant 
rise  during  Week  III,  the  observer  ratings  and  event  records  fail  to 
show  any  rise  at  that  point.  In  fact.  Week  III  showed  a  decrease 
(although  insignificant)  in  influence  as  measured  by  observer  ratings 
and  event  records. 

Table  32  presents  comparisons  between  normal  and  stress  hours 
on  affect  behaviors.  There  is  virtually  no  evidence  of  change  in  affect 
under  stress.  The  only  significant  pair  of  means  is  the  decrease  in 
affect  during  Week  I  as  measured  by  event  ret  eras.  While  this  lone 
significant  comparison  shows  a  decrease,  the  general  trend  in  Table  32 
is  for  affect  scores  to  rise  slightly  during  the  stress  hours.  The 
failure  to  find  significant  shifts  in  affect  under  stress,  along  with 
the  failure  of  all  measurement  methods  to  converge  when  measuring 
affect,  are  discussed  in  Chapter  IV. 

There  is  considerable  evidence  of  rise  in  activity  level  as 
subjects  encountered  the  stressing  work  load.  Table  33  presents 
normal  vs.  stress  comparisons  in  activity  level.  Self,  peer,  and 
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TABLE  32 

COMPARISON  OF  INFLUENCE  SCORES 
BETUIEEN  NORMAL,  AND  5TRESS  CONDITIONS 


Method 

Week 

Normal 

Condition 

Stress  Condition 

4 

M 

eet.j" ? 

M 

M  estoCjy,® 

w 

Self  Ratings 


I 

5oOC 

,60 

5o2’ 

84 

= 

II 

5,12 

.66 

5,24 

c  75 

- 

ill 

5,.  O’7 

0  • 

5.23 

L~9 

- 

IV 

5.19 

.83 

5,39 

o78 

~ 

Peer  Ratings 

I 

5tL2 

,48 

5,29 

,55 

* 

11 

5.10 

a.! 

5,2  3 

,52 

- 

III 

5 . 14 

.55 

5,22 

,53 

- 

i  V 

5.20 

,  6b 

5.36 

.61 

* 

Observer  Rat. 'res 

T 

4.52 

,49 

4 , 68 

.61 

- 

1  1 

U  .  a  ,; 

■U  • 

a.  3a 

,49 

- 

in 

4.50 

.5 1 

4.64 

,62 

- 

1 0 

a  Qfe 

5.09 

-70 

- 

Eoent  Race t os 

, 

•1 

l 

• 

t- 

u 

«  •»  - 

i 

„ 

.*.5 

1 1 

.  -.5 

1 

:  v 

.  *5 

,2  5 

.  <  4 

,2  3 

“ 

3>e  Ifpcned 

(rear,  s 

*ere  %-i 

l?r  ,  •  *£  3?.  * 

*  eiage  c* 

•  f-  f.  •  h 

?ee  mean 

?  '.o  s  bs  0  *  tr  »  ho^jL 

The 

appsof  t. 

;.y’  e  tree*  0* 

vas'  -at  L  n 

.1  s  •  ce 

s’andax  d 

error  o*  the  me  a/  (est ,  7^  . 
CP<  ,01. 


TABLE  33 


COMPARISON  OF  ACTIVITY  SCORES 
BETWEEN  NORMAL  AND  STRESS  CONDITIONS 


Method 

aleak 

Normal 

Condition 

Stress 

Condition 

M 

•st„aMa 

M 

est.cjy,® 

t 

Self  Ratings 

I 

4.25 

.82 

4.68 

,83 

2. ,89^ 

II 

4.30 

.80 

4.59 

.81 

1.99^ 

III 

4.46 

.81 

4.75 

.77 

2, 03b 

IV 

4.53 

.82 

4.53 

.77 

- 

Peer  Ratings 

I 

4.18 

.64 

4.60 

.57 

3.84= 

II 

4.22 

.68 

4.51 

.63 

III 

4.46 

.63 

4.74 

.56 

2.60 

IV 

4.44 

.67 

4.4? 

.66 

* 

Observer  Ratings 

r 

I 

4.30 

.84 

4.83 

.79 

3  59C 

II 

3.97 

1.01 

4.38 

.91 

3  33 

III 

4.27 

.99 

4.63 

.->8 

2.23b 

IV 

4.24 

.96 

4.53 

,97 

- 

Event  Records 

1 

,ur 

.41 

*  7  7 

.1 .  *»  •  V 

4 

II 

1,12 

J.  39 

III 

jU  35 

,1. 

u  35 

IV 

1.27 

.48 

1.27 

*«0 

flThe  repotted  means  were  computed  as  the  average  of  the  three  mean 
scores  of  the  hour.  The  appropriate  inde*.  of  variation  is  the  standard 
error  of  the  mean  (est.^). 

bP<  .05 
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observer  ratings  eli  snow  significant  ireraaaaa  in  activity  beteeen 
normal  and  atraaa  hours  *er  irfeeka  I,  !!,  and  III.  Cvant  fecorde 
a  how  a  significant  rlaa  for  4eek  I.  Tha  awsnt  racorda  also  shorn  a 
rise,  although  non -sign  if leant ,  for  ulaatta  II  and  III.  Tao  of  tha 
four  methods,  paar  ratings  and  observer  retinge,  show  a  rising  trand 
during  desk  iv.  In  no  caaa  Is  thars  a  dacraasa  In  activity  laval 
indicatad  as  auojacta  novae  from  normal  to  atraasing  conditions. 

Tha  fact  that  activity  incraaaaa  for  undarload  atraaa  aa  wall  aa 
for  ovarlcad  atrass  nay  ba  worth  noting. 

Parfcrman  ce 

Anaiyaas  of  the  written  products  show  the  effects  of  overload 
and  undarload  working  conditions  cn  group  performance,  ulhan  the 
results  of  all  groups  are  combined,  e.ery  product  dimension  shows 
significant  effects  of  the  working  conditions.  T8pie  3*  presents 
the  relations  between  product  dimensions  ar d  working  conditions  for 
all  groups. 

However,  the  working  conditions  had  different  effe.'ts  or  each 
ta9k  typ6„  Tnesp  m’^erences  ere  seer  when  the  grcurs  are  d.vided 
by  task  type  *  oi  analysis  o*'  stress  effects  rr  pr:*durt  dimers  ions 
Tables  15,  36,  end  l’  present  these  ef facts  fo:  production,  discussion, 
and  prop  .en  *s&;  ...ng  groups  respect  l^e  1  v 

from  these  tables  it  car  be  seen  that  the  working  conditions 
affected  ever*  task  t. ,pe  on  ^ength,  Outlook,  and  Adequacy.  Production 
and  discuss icn  groups  were  affected  cn  Action  Hrtentation0  Production 
and  prot.  em-sci.ving  groups  were  affe:ted  on  Q^a'..  t  y  Presentation,, 
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And  discussion  and  problem-solving  groups  were  affected  on  Originality, 
Involvement,  and  Creativity,,  So,  not  only  are  there  effects  of  stressful 
working  conditions  on  performance  in  general,  there  are  different 
effects  of  stress  in  each  task  type.  These  effects  are  discussed  in 
Chapter  IV. 


Summary 

The  large  amount  of  results  presented  in  this  chapter  may  be 
summarized  as  follows.  Internal  examination  of  the  various  measurement 
methods  employed  showed  them  to  be  acceptably  consistent,  reliable 
techniques.  In  the  case  of  the  Interpersonal  Adjective  Checklist,  the 
present  data  conformed  very  well  to  the  circumplex  model  upon  which 
the  test  was  built.  In  the  case  of  the  products,  comparability  to 
Hackman's  earlier  findings  was  outstanding. 

fflultitrait-multimethod  analyses  of  convergent  and  discriminant 
validity  reve  led  a  large  reservoir  of  method  variance  in  the  rating 
techniques.  The  ratings  generally  failed  to  discriminate  well  between 
traits.  At  some  points  there  was  convergent  validity,  with  at  least 
modest  discrimination.  In  spite  of  its  elegant  internai.  features, 
the  Interpersonal  Adject!  e  Checklist  failed  to  converge  with  any 
other  measurement  method  on  any  trait. 

There  were  effects  of  time  and  effects  of  stress  on  group 
behaviors.  Time  and  stress  also  affected  group  products.  In  the  case 
of  the  products,  time  and  stress  had  differential  effects  upon  groups 
of  different  task  types. 

The  results  presented  in  this  chapter  are  discussed  in  Chapter 


IV.  There  is  an  emphasis  in  Chapter  IV  on  results  which  would  have  been 


87 


overlooked  or  misinterpreted  in  a  less  systematic  study.  The  further 
discussion  in  Chapter  V  places  emphasis  upon  the  usefulness  and  potential 
of  a  systematic  research  strategy,  such  as  the  one  which  generated  the 
present  body  of  data. 


CHAPTER  IV :  WILL  THE  REAL  RELATIONSHIP 
PLEASE  STAND  UP?— 

SOME  EXERCISES  IN  INFERENCE  fROIM  LIMITED  EVIDENCE 

This  research  was  planned  and  executed  as  a  methodological 
study.  As  such,  the  design  was  to  pursue  a  research  strategy  which 
seemed  promising  by  comparison  to  more  frequently-used  strategies. 

No  hypotheses  about  the  findings  were  set  forth  in  advance  and  no 
specific  substantive  findings  were  sought.  The  analyses  performed 
and  reported  in  Chapter  III  were  intended  to  suggest  the  compre¬ 
hensiveness  of  the  research  strategy. 

Two  related  questions  determined  the  form  of  the  present 
chapter,  which  discussed  the  results  obtained:  (1)  What  kinds  of 
findings,  ordinarily  overlooked,  are  made  available  by  this  research 
strategy?  and  (2)  What  kinds  of  findings,  frequently  misinterpreted 
in  other  research,  are  elucidated  by  the  present  strategy?  This 
chapter  takes  the  rnm  of  citing  selected  findings  which  might  have 
been  obtained  and  overlooKed  or  misinterpreted  in  other  strategies 
of  research.  Then,  the  findings  are  shown,  appropriately  interpreted, 
in  the  context  of  the  present  systematic  research. 

Relationship  Between  Influence  and  Affects  The  Method  Variance  Trap 

Suppose  that  Mr.  Brown,  social  psychologist,  hypothesized  a 
relationship  between  influence  and  affect  in  small  groups.  Specifically, 
he  hypothesized  that  as  group  members  become  more  influential,  they  are 
better  liked.  In  order  to  test  his  hypothesis  he  studied  several  small 
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group3„  measuring  influence  by  self-ratings  made  by  each  subject  and 
affect  by  peer  ratings  of  How  likeaole  each  subject  was. 

Or  the  oasis  of  Table  9,  which  would  include  his  data  from  a 
single-ecsasion  study,  the  hypothesized  relationship  (.40)  is  significant 
(p<„01 ).  He  would  have  concluded  that  his  hypothesis  was  confirmed. 

Now,  suppose  that  a  second  researcher  accepts  the  relationship 
between  influence  and  attraction,  but  hypothesizes  that  th6  relationship 
breaks  dcwr  over  time.  He  performs  a  two-occasion  study  on  several 
groups,  using  the  same  measurement  methods  as  Mr.  Brown.  Values  in 
Tables  9  and  11  contain  the  second  researcher's  potential  findings. 

The  significant  (.40)  correlation  obtained  C6tween  influence  and  affect 
drops  to  an  insignificant  level  (-.03)  on  the  second  occasion.  He 
concludes  that  his  hypothesis  is  confirmed . 

As  a  further  supposition ,  consider  a  third  researcher  who 
believed  Wr.  Brown's  findings  aoout  the  significant  relationship. 

He  hypothesized  that  the  findings  detained  by  the  second  researcher  mav 
ha ve  peer  due  to  different  levels  of  stress  introduced  in  his  two 
experimental  sessions.  In  his  experiment-  this  third  researcher  used 
self  ratings  of  influence,  as  did  his  predecessors.  But  for  a  more 
objective  measure  of  affect,  he  employed  a  system  comparaole  to  the 
event  records  in  the  present  study. 

tables  9  and  10  contain  the  third  researcher's  potential 
findings.  Under  normal  working  conditions  (Tat>le  9)  he  finds  a 
significant  (p<  .05)  correlation  of  0 50  between  influence  and  a?fect. 

And  during  tne  stressing  overload  condition  (Tabie  10)  the  correlation 
drops  to  an  insignificant  .19.  as  a  result,  he  concludes  that  stress, 
net  time,  wee  ‘he  likely  cause  rcr  the  deteriorating  relationship 
between  influence  and  affect  in  small  groups. 
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As  a  finale  to  this  senes  of  experiments,  a  fourth  researcher 
decides  to  follow  up  previous  studies,  measuring  the  relationship 
between  influence  and  affect,  However,  he  finds  it  convenient  to 
modify  the  measurement  methods  somewhat.  He  allows  each  person  to 
rate  his  own  influence  in  the  group,  as  before.  But  an  impartial 
observer,  watching  the  group,  rates  each  member  on  affect. 

Tables  9,  10  and  11  contain  the  fourth  man's  potential 
findings.  On  the  first,  ncrmal-load  (Table  9)  occasion  he  fails 
to  find  a  significant  relationship  (r^.17),  However,  the  next  hour’s 
stressing  load  (Tabie  10)  generates  a  significant  (p<..05)  relationship 
of  ,27,  A  week  later  (Table  II)  ne  finds  that  the  relationship  is  again 
insigrif leant  (r^.ll),  His  findings  contradict  all  previous  findings 
of  a  significant  relationship  between  influence  and  affect  in  small 
groups  under  normal  working  conditions.  And,  his  significant  findings 
during  the  overload  hour  reverses  another  of  tne  previous  findings, 

Wt at  car  be  said  to  clarify  this  tangled  series  of  research 
results?  In  the  context  of  the  present  design,  all  correlations 
cited  are  heterotrait-heteromethod  values ,  These  values  have  neither 
trait  nor  methoa  in  common  and,  tnerefore,  should  be  low,  !ne  fact 
is  that  ail  rating  methods  used  are  plagued  by  method  variance  in  jotn 
normal  and  stress  conditions.  And  the  convergent  validity  of  all 
methods  wher  measuring  affect  was  extremely  low.  The  findings  obtained 
depend  primarily  upon  the  methods  used,  not  upon  the  relationship 
supposedly  measured.  All  four  of  the  hypothetical  researchers  mentioned 
above  capitalized  on  method  variance  and  on  chance. 
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Creativity  in  Groups;  Confounding  Task  Types 

Quite  a  separate  stream  of  research  is  initiated,  let  us 
suppose,  by  Mi ,  Smith,  His  interest  is  in  the  outcome  or  performance 
of  small  groups  on  intellective  tasks.  He  hypothesizes  that  group 
performance  will  improve  over  time  on  such  tasks.  By  "improve"  he 
means  "become  more  creative." 

Mr,  Smith  selects  a  set  o^  intellective  tasks,  comparable  in 
difficulty,  and  submits  them  to  several  groups  of  subjects.  The 
groups  meet  for  four  consecutive  weekly  sessions  to  write  responses 
to  these  tasks.  Mr..  Smith  hires  and  trains  "blind"  judges  to  rate 
these  products  for  creativity  (by  operations  comparable  to  those 
used  In  the  present  study). 

Tab*.e  29  contains  a  set  of  possible  outcomes  of  Mr.  Smith's 
research,  the  creativity  scores  rise  from  week  to  week,  showing  an 
overall  significant  (p^.01)  gain  in  creativity  in  four  weeks.  On 
this  basis,  Mr.  Smith  concludes  that  his  hypothesis  is  confirmed. 

(He  may  even  be  tempted  by  the  opportunity  to  earn  a  sizeable  sup¬ 
plementary  income  by  "training*  groups  to  become  more  creative,  ) 

At  another  university  a  second  researcher,  eisi)  Interestec 
in  group  pe  formance,  feels  that  the  creativity  gain  Over  time  snouio 
be  enhanceo  by  tasks  which  stimulate  creativity.  However,  he  uses 
a  di^er-ent  type  of  task  than  Mr.  Smith,  and  redefines  "goodness" 
as  "originality,"  (in  a  manner  similar  to  the  originality  ratings 
of  the  present  study).  He  obtains  quite  differsnt  results,  Tabie  28 
contains  potential  results  from  this  secono  researchers  study.  It 
shows  that  originality  aces  change  significantly  (p£.01 .)  ever  time 
But  originality  generally  declines,  rather  than  increases  over  time. 
In  contrast  to  Mr,  Smith's  results,  groups  were  significantly  less 
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original  after  four  weeks  than  they  had  been  in  the  first  wee*. 

A  third  researcher  is  skeptical  of  both  previous  studies.  He 
questions  the  operating  conditions  of  the  two  studies,  suspecting 
differential  stress.  To  test  his  suspicions,  he  designs  a  study  which 
includes  normal  load,  overload,  and  underload  conditions,  using  dis¬ 
cussion  tasks  in  small  groups.  As  dependent  measures,  he  checks  both 
the  originality  and  creativity  of  the  products. 

Table  36  contains  the  potential  findings  of  the  study  outlined 
above.  It  shows  significant  changes  in  Doth  originality  and  creativity 
under  stress.  Both  tend  to  decrease  with  overload  and  increase  with 
underload.  The  conclusion  drawn  is  that  stress,  not  time,  is  likely 
to  account  for  changes  in  group  products. 

A  fifth  researcher,  whc  doubts  the  "stress"  explanation,  tries 
to  replicate  some  of  the  foregoing  research.  He  reproduces  the  normal, 
load,  overload,  and  underload  working  conditions  of  the  prior  researchers. 
And  he  trains  his  product  judges  to  rate  for  originality  and  creativity. 
His  findings,  potentially  contained  in  Table  35,  show  no  significant 
changes  due  to  stress  in  either  originality  or  creativity. 

What  has  happened  in- this  hypothetical  stream  or  research. 

Does  time  affect  group  products?  Does  stress  affect  group  products? 

Some  answers  are  available  from  the  present  study. 

The  answers  are  "Yes,  sometimes"  for  the  effects  of  both  time 
and  stress  on  group  products.  Time  dees  affect  some  types  of  group 
tasks  on  some  rating  dimensions,  but  not  all  types  of  tasks  nor  on  all 
rating  dimensions.  Stress  also  affects  products.  But  task  types  and 
rating  dimensions  are  affected  differentially. 

What  has  happened  in  the  hypothetical  series  of  studies 
sketched  above  is  that  different  researchers  have  unwittingly  switched 
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task  types  or  each  other.  They  have  also  looked  for  slightly 
different  qualities  in  the  products.  Their  findings  appear  to  be 
contradictory  because  their  research  was  deceptively  similar,  while 
being  different.  Failing  to  note  subtle  but  important  distinctions 
between  task  types,  they  generalized  a  bit  too  far. 

Patterning  in  Group  Process f  How  Many  Points  Makes  a  Pattern? 

A  third  hypothetical  stream  of  research  might  be  initiated  by 
Mr.  Jones,  investigator  of  small  group  interaction  patterns.  His  interest 
lies  in  the  relationship  between  the  giving  and  receiving  of  influence 
among  group  members.  He  hypothesizes  a  relationship  between  influence 
given  and  influence  received. 

Using  an  event  records  system  such  as  the  one  designed  in  the 
present  research,  Mr.  Jones  observes  several  groups  in  a  single¬ 
occasion  study.  His  result,  contained  potentially  in  Table  38,  is 
only  a  small  (.16)  insignificant  correlation  between  influence  given 
and  influence  received. 

One  Mr.  _iones?  students  pursues  the  question  further,  thinking 
that  the  relation sh J c  hypothesized  may  become  apparent  through  ♦.fme, 

He  studies  several  groups,  each  of  whir1,  r~  **ek.T  v  nne-r.ni'w  sessions. 
His  results  (the  normal -load  entries  of  Table  38)  confirm  his  hypothesis. 
Indesa,  tne  fourth  session  shows  e  significant  (p£.05)  positive  correlation 
of  .26  during  the  fourth  week. 

An  industrial  psychologist,  upon  reading  of  these  results, 
considers  their  relevance  for  committees  in  his  corporation.  However,  he 
suspects  that  working  conditions  may  affect  the  relationship. 

The  industrial  psychologist  thus  designs  a  study  which  varies 
the  working  conditions  in  a  pattern  comparable  to  that  in  the  present 
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TABU  38 

CORRELA TIONS  BE  TWEEN  INFLUENCE  GIVEN  AND 
INFLUENCE  RECEIVED  FOR  SELECTED  OCCASIONS 


Week 

Occasion 

Condition 

r 

N 

P 

I 

1 

Normal  Load 

.16 

62 

ns 

5 

Overload 

.26 

62 

£.05 

B 

Normal  Load 

.18 

62 

ns 

II 

12 

Underload 

.26 

62 

<.05 

15 

Normal  Load 

,12 

62 

ns 

III 

19 

Overload 

c 

Ol 

03 

62 

<c01 

22 

Normal  trad 

„26 

57 

<.05 

IV 

26 

Underload 

.51 

57 

£.01 
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study  (Figure  2),  His  potential  results  include  all  of  Table  38. 
from  these  results  he  concludes  that  stress  (both  overload  and  underload) 
strengthens  the  relationship  between  influence  given  and  influence 
received.  He  also  notes  that  time  strengthens  the  relationship  in 
both  stress  and  non-stress  conditions. 

In  fact,  the  stream  of  research  outlined  above  may  have 
arrived  at  mistaken  conclusions  at  several  points.  While  tne 
data  drawn  from  the  present  study  and  contained  in  Table  38  are  legitimate, 
they  do  not  present  the  full  picture.  When  several  behavior  samples  are 
taken  during  each  experimental  hour,  rather  than  just  one,  the  picture 
becomes  much  more  complex.  The  trends  which  seemed  clear  earlier  then 
vanish.  The  full,  segment-by-segment  picture  is  contained  in  Table  39. 

This  Table  shows  great  variability  in  the  relationsnip  between 
influence  given  and  influence  received.  The  relationship  varies  widely 
within  each  experimental  hour,  regardless  o^  experimental  conditions, 
until  the  final  week.  During  Uleek  IV  the  relationship  is  significantly 
positive  throughout. 

This  chapter  has  aimed,  in  a  somewhat  unusual  fashion,  to  demon¬ 
strate  the  usefulness  of  collecting  a  systematic  tody  of  dnta  on  .several 
traits,  by  multiple  methods,  through  time..  Many  thought! u.Uy-conceived, 
carefully  conducted  studies  run  great  risks  of  o'erlooklng  important 

♦  K 

finding*  or  misinterpreting  observed  findings,,  In  part,  these  risks 
exist  because  researchers  use  single  methods  of  measurement,  rather  than 
multiple  methods,  in  each  study..  Or,  they  measure  orly  one  trait. 

As  an  alternative,  they  may  measure  several  traits  by  several  methods, 
using  one  method  per  trelt. 

Other  sources  which  contribute  to  the  risk  are  the  failure  to 
conduct  studies  through  time  end  under  verisd  working  conditions. 


CORRELATIONS  BETWEEN  INFLUENCE  GIVEN  AND 
INFLUENCE  RECEIVED  FOR  ALL  OCCASIONS 
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In  additior,  and  as  uias  shown  in  the  present  study,  differences  in 
group  tasks  can  easily  lead  different  researchers  to  opposite  conclusions, 

The  several  hypothetical  ''research  programs"  used  here  for 
illustrative  purposes  are  by  no  means  unrepresentative  designs.  Indeed, 
they  were  chosen  because  they  represent  rather  typical  research  approaches 
and  substantive  questions  in  the  small  group  research  area. 

There  are  far  more  than  a  majority  of  single-relationship 
studies  in  the  small  group  research  field  (McGrath  and  Altman,  1966), 
furthermore,  some  sizeable  bodies  of  "findings,"  which  now  have  the 
status  of  "established  principles,"  are  based  on  relationships  between 
two  variables,  each  measured  by  the  same,  single  method  (most  often, 
the  ubiquitous  self-report).  The  first  illustration  in  this  chapter 
shows  how  easily  one  can  be  tempted  to  over-interpret  partial  data,  in 
directions  that  turn  out  to  be  systematic  in  error. 

The  general  problem  of  disregard  fcr  task  properties  in  studies 
of  group  performance  has  been  discussed  at  length  in  a  number  of 
sources  (e„g,,  Shaw,  1964;  Hackman,  1966;  McGrath  and  Altman,  1.966). 

The  more  subtle  nut  equally  widespread  problem  of  variations  in  "cntenor  r 
of  group  tasl*  performance  effectiveness  from  cne  stuuv  tc  another  nrt&  -iso 
been  notea  in  the  literature  (e«g„,  Hackman,  Jones  and  McGrath,  1967). 

The  second  "i  Uustr.-at.fse  research  p:  ogram "  described  in  this  chapter 
exemplifies,  from  a  tocy  of  systematic  empirical  aata,  hew  these  task- 
related  pitfalis  can  lead  the  investigator  astray  in  the  group  perfor¬ 
mance  research  area. 

Many  theorists  and  researchers  have  emphasized  ths  need  for 
studies  of  patterning  in  group  process  through  time.  Very  little  work 
has  been  done  on  tnis  topic,  ’he  third  "illustrative  research  pj ogram" 
described  here  suggests  seme  of  the  reasons  why  there  has  beer  such 
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limited  effort,  in  this  area.  Ever  the  patterns  of  relationships  between 
two  quite  similar  variables  (influence  given  and  influence  received) 
over  time  turn  out  to  be  very  complex,.  Perhaps  some  important  general 
relationships  can  be  discovered  by  igncring  seme  of  the  fluctuations 
through  time  (as  aoes  fable  38 )»  But  this  implies  that  somehow  the 
data  from  time-points  selected  for  inclusion  ate  "true,"  and  data  from 
the  omitted  time -segments  are  "error,,"  That  is  a  very  strong  assumption 
for  an  investigator  to  make  in  an  area  which  is  so  largely  unexplored. 

Data  presented  in  Table  39  give  one  pause  regarding  any  straightforward 
inferences  about  the  nature  of  the  relationship  between  these  two  variables 
as  a  function  of  calendar  time,  task  time,  practice,  or  stress  condition. 

It  does  make  clear,  though,  that  we  cannot  afford  the  "classic"  one- 
occasion  study  if  we  are  tc  learn  about  patterning  of  variables  in 
group  interaction. 

There  are  a  number  cf  other  "classical"  research  designs  and 
problems  from  the  small  group  research  area  which  could  have  beer 
selected  for  illustrative  purposes  here,  and  for  which  the  present 
study  contains  a  systematic  set  cf  relevant  data.  For  example,  this 
study  has  evidence  (via  several  methods  of  measurement)  *n  us  oil  lament 
and  charges  in  "system  balance  "—regarding  c&fuuurdcatiun,  avcracvion, 
and  i.npluence-“>in  interpersonal  systems.  As  another  example,  the  study 
contains  data  which  can  be  used  to  explore  systematically  the  problam 
cf  phases  in  group  development  over  time,  again  via  several  methods  of 
measurement.  These  and  other  Questions  will  be  explored  in  later  analyses. 

But  the  main  purpose  here  is  in  illustrating  the  potentialities 
that  flow  from  the  systematic  approach  to  data  collection  which  this  study 
has  follc>wed-~ana,  incidentally,  to  show  that  such  a  systematic  strategy 
is  feasible  to  carry  out  within  relatively  limited  resources^ 


CHAPTEP  Vs  FURTHER  DISCUS SI  UN  AND  SUMMARY 


Chapter  IV,  above,  illustrated  the  usefulness  of  a  systematic 
approach  to  research  in  small  groups  by  exposing  some  types  of  errors 
which  naunt  other  more  restricted  research  strategies.  The  presen¬ 
tation  of  this  final  chapter  is  more  positive,  discussing  some  of  the 
methodological  implications  cf  the  findings  of  this  research  and 
pointing  to  potential  gains  available  from  systematic  research  approaches. 

This  chapter  begins  with  some  reflections  on  the  measurement 
methods  used  in  the  present  study.  Second,  there  is  a  brief  discussion 
of  the  traits  (behaviors)  which  were  measured.  And  third,  some  of 
the  advantages  of  the  systematic  research  are  cited,  These  advantages 
are  seer  to  accrue  from  both  the  multi-measure  and  multi-occasion  features 
of  the  present  research.  The  chapter  closes  with  a  brief  summary. 

Discussion  of  Specific  Methods  of  Measurement 

Interpersonal  Adjective  Checklist 

The  Interpersonal  Adjective  Checklist  displayed  satisfactory 
internal  characteristics.  Its  circumplex  ordering  was  virtually  perfect. 
It  discriminated  clearly  oetween  the  traits  measured,  including  the 
"activity”  trait  for  which  it  was  ncv;  originally  intended  as  a  measure, 
and  had  reasonable  reliability  unaer  the  circumstances „ 

Yet,  the  Checklist  failed  to  converge  with  ary  other  measure  on 
any  of  the  traits.  Its  correlations  with  the  other  methods  approach 
zero  at  every  point  .  Even,  its  correlations  with  the  resuits  of  ratings 
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and  event  records  which  were  obtained  during  the  same  hour  as  the 
admin  is  tra  tier,  of  the  Checklist  were  near-zero.  Thus,  although  the  ICL 
had  fairly  attractive  internal  features  and  the  least  method  variance  it 
was  weakest  in  convergent  validity. 

Two  comments  stemming  from  this  experience  with  the  Checklist 
seem  in  order,  first,  it  illustrates  the  severe  constraints  under  which 
personality  assessors  must  labor.  Even  the  reliabilities,  which  by 
comparison  to  other  available  personality  tests  are  considered  good, 
place  strict  limits  upen  the  measurement  of  personality  traits.  If 
the  reliability  of  a  test  over  a  2-  to  4-month  interval  is  .50  or  .60, 
as  it  was  for  the  Checklist  in  the  present  study,  its  validity  can  be 
no  higher  than  .50  or  ,60.  The  fact  seems  to  be  that  personality 
traits  are  very  difficult  to  measure  teiiably,  to  say  nothing  of  measuring 
validly.  And  if  the  multimethod  convergent  validity  coefficients  in 
this  study  are  any  indication,  validity  is  a  scarce  commodity  in  much 
of  our  measurement. 

The  second  comment  emmanating  from  this  experience  with  the 
Checklist  is  that  oersonalitv  traits  may  be  situational.  That  is, 
"personality '  may  not  consist  of  internal  proper v  es  uf  it,?  person 
whicn  are  invariant  across  time  ana  piece,  Rather,  it  may  corslst  of 
inreractiors  of  the  person  and  tne  situation  i.r  which  he  finds  himself. 
This  implies  that  part  of  whet  is  measured  as  "'personality  trait' 
actually  "exists  in*  the  measurement  situation, 

If  so,  then  a  reliability  coefficient  ir.  the  fifties  cr  sixties 
across  a  2-  to  4  month  time  interval  wculd  seem  quite  encouraging. 

After  ali,  the  situation  in  which  the  testing  took  place  was  somewhat 
different  on  th6  twe  occasions  and  this  would  account  for  a  moderate 
(as  opposed  to  high)  reliability.  This  explanation  would  be  attractive 
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if  the  Checklist  had  correlated  well  with  other  measures  taken  during 
the  same  hour  even  though  if  it  had  correlated  less  well  with  measures 
taken  at  more  distant  times.  This,  unfortunately,  was  not  the  case. 

So,  there  is  no  evidence  in  this  study  that  the  Interpersonal  Adjective 
Checklist  was  sensitive  to  the  particular  setting  in  which  it  was 
administered. 

Ratings 

All  evidence  in  the  present  study  points  to  an  enormous  method 
variance  in  all  ratings  methodologies,  particularly  self  ratings  and 
peer  ratings.  It  would  appear  that  an  experimenter  could  find  a  sig¬ 
nificant  positive  relationship  between  virtually  any  pair  of  variables, 
provided  that  he  measured  both  by  a  rating  methodology.  In  the  light 
of  the  observation  of  Webb,  et .  al .  (1966)  cited  earlier,  that  ratings 
and  questionnaires  dominate  social  science  as  a  measurement  device, 
this  finding  casts  serious  doubt  on  a  number  of  established  relationships 
in  the  field  which  have  been  measured  primarily  by  ratings  and  which  thus 
become  suspect. 

In  fairntss  re  the  observers,  it  must  be  pointed  ojt  thpt  the 
observer  ratings  were  plagued  somewhat  le?s  oy  method  variance  then 
self  or  peer  ratings.  The  observer  ratings  also  showed  some  convergent 
validity  with  the  event  records ,  an  achievement  not  attained  by  the 
other  rating  methods.  Thus,  data  from  the  observer  fared  a  bit  better 
than  data  from  subjects  in  regard  to  both  discriminant  and  convergent 
validity. 

Event  Records 

The  event,  records  system  of  observation  designed  in  this 
research  deserves  further  exploration  and  development.  By  comparison 
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to  other  observational  systems  (e„g.,  Bales,  1950b)  it  is  a  low- 
inference  system.  In  addition,  the  inferences  in  this  system  are 
explicit,  rather  than  hidden  within  the  observer  or  the  training 
process. 

Potentially,  the  event  records  system  may  be  exploited  as  a 
technique  for  measuring  a  wide  variety  of  variables.  Many  of  the 
behavioral  indices  reviewed  by  Argyle  and  Kendon  (1967)  and  especially 
those  of  Sommer  (1959,  1961,  1962,  1965)  could  be  taken  with  great 
accuracy  by  this  method. 

Although  the  event  records  as  taken  in  the  present  study 
involved  the  placement  of  a  camera  in  the  experimental  room,  operated 
directly  by  the  observer,  this  need  not  have  been  the  case.  The 
methodology,  in  fact,  is  potentially  quite  unobtrusive.  The  camera 
could  have  been  poised  behind  one-way  glass  outside  the  room.  The 
event  records  methodology  is  also  portable,  capable  of  being  carried 
into  a  wide  variety  of  natural  (field)  settings.  In  many  field  settings 
it  is  possible  to  focus  the  camera  on  a  group  of  subjects  from  a 
distance  of  twenty  cr  thirty  yards,  using  a  telephoto  lens.  This  makes 
it  possible  to  collect  data  unobtrusively  in  u  wide-  range  uf  opei.  jottings. 
Thus,  the  event  records  methodology  hoios  consiaeraole  potential  as  a. 
unobtrusive,  flexible,  portable  system  for  data-coliecti on  of  behaviors 
in  small  groups. 

Improvement  of  the  event  records  system  seems  most  likely  to 
come  from  the  use  of  videotape  devices.  The  movie  cameras  used  i.n 
the  present  study  could  have  been  replaced  by  viaectape  cameras. 

Videotaping  provides  a  sound  track  which  is  perfectly  synchrcnizeo  with 
the  visually-observea  behaviors.  (Currently-available  videotape  equip¬ 
ment  is  nearly  as  portable  as  a  movie  camera  for  use  in  field  studies 


103 


and  has  no  mere  lighting  requirements  than  black  and  white  films.  ) 

A  t/ideotape  apparatus  was  used  as  a  training  dev/ice  for  some 
observers  in  the  present  study.  This  brief  experience  demonstrated 
the  feasibility  of  making  videctapes  of  small  group  behaviors  for 
subsequent  analysis.  It  also  demonstrated  that,  while  the  cost  of 
videotape  equipment  is  high,  once  it  is  acquired  the  cost  of  its  use 
for  data-collecting  sessions  is  not  prohibitively  expensive,  and 
use  of  videotape  as  a  training  device  or  mode  for  presentation  of 
stimulus  material  is  relatively  economical. 

Thus,  there  are  features  of  merit  in  the  event  records  system. 
Mot  all  of  the  possibilities  of  this  system  were  exploited  in  the 
present  study,  leaving  room  for  considerable  future  development. 

□ne  of  the  ways  in  which  the  event  records  system  may  develop  is  in 
the  use  of  videotape. 

Oiscussion  of  Specific  "Traits"  in  Interpersonal  Interaction 

It  is  difficult  to  make  general  statements  aDOut  the  traits 
(behaviors)  measured  in  the  present  research.  The  general  method 
variance  of  the  qs  has  been  noted.  This  indicates  restricted 

discrimination  between  traits  for  rating  methods.  Also  n^ted  was 
the  discrimination  Out  non-convergence  of  traits  for  the  Interpersonal 
Adjective  Checklist,  the  event  records,  while  discriminating  moderately 
well,  conv/erged  with  another  method  (observer  ratings)  on  only  one 
trait  (influence).  These  weaknesses  of  the  measurement  methods  make 
it  impossible  to  be  confident  of  any  conclusion  concerning  the  traits 
as  unitary  constructs. 


Under  the  original  plan  op  analysis,  there  appeared  to  be 
convergence  among  several  methods  on  doth  influence  and  activity. 
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However,  the  operational  definitions  of  influence  and  activity, 
designed  to  reflect  prevalent  conceptions  of  these  traits,  connected 
them  artifactualiy .  When  the  artifact  was  removed,  for  ratings  and 
for  event  records,  the  method  variance  of  the  event  records  and  the 
convergence  of  the  rating  methods  Doth  decreased. 

The  remaining  convergence  in  the  multitrait-muitimethod 
matrices  stood  between  observer  ratings  and  event  records  on  influence, 
at  a  statistically  significant  but  not  overpowering  level.  For  activity, 
convergence  became  very  weak  when  its  overlap  with  influence  had  been 
removed.  And  for  affect,  which  never  enjoyed  anything  but  method 
variance,  there  was  no  convergence  at  any  point. 

One  conclusion  to  be  drawn  from  this  series  of  analyses  is 
that  to  remove  the  method  variance  from  the  measures  is  to  change 
the  concept  being  measured.  The  concepts  available  for  measurement, 
while  widely  used,  are  so  method-bound  that  results  of  their  measurement 
are  primarily  a  matter  of  method  variance.  In  this  case,  method 
variance  was  decreased  by  re-defining  "activity"  in  a  non-interpersonnl 
way.  Another  conclusion  to  be  drawn  from  these  analyses  is  that  we 
must  remain  quite  skeptical  as  to  the  convergent  and  discriminant 
validity  of  these  traits,  with  the  possible  exception  of  influence, 
even  though  these  three  traits  have  been  identified  in  the  literature 
as  pre-eminent  variables  in  small  group  interaction. 

Petential  Gains  From  Use  of  a  Systematic  Strategy 

Gains  From  a  Multi -Measure  Approach 

The  basic  multitrait-multimethod  argument  (Campbell  and  Fiske, 
1959;  Webb,  e^t.  ai. ,  1966)  is  that  any  measurement  consists  of  a 
trait-method  complex,  which  contains  a  'true"  component  and  some 
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"error, "  Different  methods  of  measuring  behavior  must,  by  definition, 
share  the  "true"  component,  but  will  tend  to  contain  different  "error," 

For  any  particular  measurement  method  the  errors  are  likely  to  pattern 
or  contain  systematic  bias  regardless  cf  the  "trait"  being  measured, 

The  use  of  multiple  measures  which  are  subject  to  different  patterns 
of  error  is  proposed  as  a  means  of  separating  the  "true"  trait  component 
from  the  method-linked  error. 

Even  when  a  completely-crossed  multitrait-multimethcd  design 
is  not  feasible,  having  the  goal  of  multiple  measures  can  be  beneficial 
to  the  researcher.  If  an  investigator  adopts  the  goals  of  developing 
multiple  measures  for  each  of  nis  concepts,  he  is  likely  to  clarify  the 
concepts  (traits,  behaviors)  being  measured  considerably.  Specifically, 
he  denies  himself  the  use  of  "traits"  for  which  there  is  only  ore  possible 
measure.  When  only  one  method  is  (or  can  be)  used  to  measure  a  trait 
over  a  series  of  experiments,  the  trait-method  complex  becomes  the  unitary 
construct,  7he  trait  thus  tends  to  reflect  some  of  the  error  peculiar 
to  the  particular  measurement  method  with  which  it  is  yoked. 

It  may  be  charted  that  there  are  some  concepts  which  can  be 
measured  by  only  one  method.  Perhaps  seif  reports  are  the  oni v  me too. 
which  can,  asses  certain  variables  dealing  with  psychological  ei.perier  s. 

If  so,  such  variables  are  eternally  method-bound.  There  is  nc  way 
in  which  toe  error  pattern  peculiar  o  the  measurement  methca  car  do 
separated  from  foe  trait  always  measured  thereby.  Such  concepts  are 
always  method-bound,  and  hence  limited  in  the  general  Liability  of 
results  obtained  from  tneir  use. 

Thus,  while  use  of  multiple  methods  may  not  always  oe  feasible, 
they  are  always  desirable,  They  provide  a  means  of  separating  method 
(error)  variance  from  trait  ("true")  variance.  Besides,  the 
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implementation  of  this  approach  in  specific  experiments  can  stimulate 
the  investigator  to  develop  methodologically  and  conceptually  "clean" 
constructs. 

Gains  from  a  Multi-Occasion  Approach 

The  gains  from  a  multi-occasion  approach  are  numercus0  The 
most  obvious  is  that  changes  in  behaviors  through  time  may  be  observed. 

The  call  for  this  type  of  knowledge,  as  noted  in  Chapter  I,  has  come 
repeatedly  from  many  sources. 

Another  kind  of  gain  from  the  multi-occasion  strategy  deserves 
mention,  however.  Consider  that  traditional  nomothetic  research  has 
tended  to  measure  behaviors  over  a  large  number  of  subjects  on  a 
single  occasion.  In  part,  this  tendency  must  be  due  to  the  convenience 
of  single-occasion  studies  and  to  the  availability  of  statistical  power 
for  this  type  of  design.  It  has,  in  the  process,  tended  to  concentrate 
on  traits  which  are  common  to  large  numbers  of  people,  rather  than  traits 
which  tend  to  identify  individuality. 

In  contrast,  the  idiographic  approach  has  pressed  for  study  of 
the  traits  which  identify  individuality.  In  sc  doing,  the  idionraphic 
approach  has  tended  to  conduct  muiti-oc easier  studies  of  s-fng  e  r  dividual?  „ 
What  has  been  lacking  is  the  statistical  power  to  handle  many  of  the  data 
collected  idlographically. 

Now,  one  can  manipulate  these  concepts  in  an  interesting  way. 
for  example,  the  traditional  single-occasion,  large  population  study 
could  be  viewee  as  an  idiographic  study  of  an  occasion.  With  this 
view,  the  occasion  is  the  object  being  studied  and  the  people  are  a 
series  of  sample  points.  In  contrast,  the  study  of  a  single  individual 
ever  time  car?  be  consioereo  Idiographic  with  respect  to  population,  but 
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nomothetic  with  respect  to  occasion*  The  individual  is  the  object 
being  studied,  and  the  occasions  of  measurement  are  the  sample  points* 

It  follows  that  if  a  multitrait-multimethoo  matrix  can  be 
constructed  for  an  occasion,  as  has  been  done  for  each  of  several 
occasions  in  the  present  study,  one  could  be  constructed  for  an  indivi¬ 
dual  person*  The  matrix,  with  traits  nested  inside  the  methods, 
would  take  exactly  the  same  form  as  Tables  9-24.  Convergent  and 
discriminant  validity  in  the  matrix  would  point  to  valid  methods  of 
measurement  for  that  individual  (over  a  series  of  time-ordered 
occasions ) * 

It  would  seem  useful  to  pursue  such  a  strategy  whenever  one 
was  interested  in  obtaining  valid  indices  of  behavior  for  a  specific 
individual*  It  might  also  be  a  useful  technique  for  identifying 
subsets  of  individuals  for  whom  a  certain  measurement  method  yields 
valid  results*  Above  all,  this  would  appear  to  be  a. useful  strategy 
for  those  areas  of  behavioral  science  which  deal  with  very  large  social 
systems  (organizations,  total  societies),  for  which  only  ore  or  a  few 
"cases”  exist,  but  for  which  comprehensive  measurements  over  time  can 
be  obtained*  While  aata  from  the  present  stuu',  are  not  if.c.yit;  - 
discussed  from  t”>e  above  perspective  here,  Sucn  analyses  wiii  oe  purs-i&a 
in  future  explorations  of  these  data* 

Summary 

This  has  been  a  methodological  study,  desi.gr eo  to  shew  the 
value  of  a  systematic  comprehensive  measurement  approach  to  research 
on  small  groups*  The  main  themes  of  the  study  have  beer  (1)  the  value 
of  using  multiple  measurement  methods  ard  (2)  the  value  c*  measurement 
over  a  series  of  time-ordered  occasions* 
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The  results  presented  are  a  selected  subset  of  the  total  array 
of  results.  Their  selection  was  designed  to  show  the  power  of  a 
systematic  body  of  data.  These  results  have  been  discussed  in  two 
mays o  The  discussion  in  Chapter  IV  shows  how  some  more  limited  research 
designs  may  lead  to  interpretive  problems  or  errors  which  could  be 
avoided.  Chapter  V  looks  at  specific  methods  and  traits  studied  and 
considers  the  advantages  of  the  present  approach  to  research  on  small 
groups. 


LIST  OF  REFERENCES 


Altman,  I.,  4  Haythom,  W.  W.  Interpersonal  exchange  in 
isolation*  Sociometry .  1965,  28,  411-426.  (Republished:  Defense 
Documentation  Center,  AD66209l77 

Appiey,  M.  H.,  4  Trumbull,  R.  Psychological  stress.  New 
York:  Appleton-Century-Crofts ,  1967, 

Argyle,  A,,  4  Kendcn,  A.  The  experimental  analysis  of  social 
performance.  In  L.  Serkowitz,  (Ed,),  Advances  in  experimental  social 
psychology.  Vol.  3.  New  York:  Academic  Press,  1967,  Pp.  55-99. 

Bales,  R.  F.  A  set  of  categories  for  the  analysis  of  small 
group  interaction.  American  Sociological  Review.  1950,  15,  257-263.  (a) 

Bales,  R.  F,  Interaction  process  analysis:  A  new  method  for 
the  study  of  small  groups.  Reading,  Mass. :  Addison- Wesley ,  1950.  (b) 

Bales,  R.  F.,  4  Gerbrands,  H.  The  interaction  recorder. 

Human  Relations,  1948,  1^  456-463, 

Bales,  R.  F.,  4  Slater,  P.  E.  Role  differentiation  in  small 
decision-making  groups.  In  T.  Parsons  4  R.  F.  Bales  (Eds.), 

Family,  socialization,  and  interaction  process.  Glencoe,  Ill, :  Free 
Press,  1955,  Pp.  259-306. 

Bales,  R.  F. ,  4  Strodtbeck,  F.  L, ,  Phases  in  group  problem¬ 
solving,  Journc..  of  Abnormal  Social  Psychology,  1  °c  *94  96, 

In  W.  Vinacke,  R.  Wilson,  4  M.  Meredith  (Eds, . ,  Pinter  sions  c.t  -  .va-.. 
psychology.  Chicago:  Scott,  Foiesmar  ,  1964,  Pp,  ^.15-4' S, 

Borgatta,  E.  F.,  Cottrell,  L.  S.,  4  Mann,  J.  M.  The  spectrum 
of  inoiviaual  Interaction  characteristics?  An  interdimensional 
analysis.  Psychological  Reports.  1958,  4,  279-319, 

Campteil,  D.  T.  Recommendations  for  APA  test  stanoaros 
regaroing  construct  trait  or  discriminant  validity.  American  Ps  ycholooist. . 
I960,  i£,  546-553. 

Campbell,  D.  T« ,  4  Fiske,  D.  W.  Convergent  and  discriminant 
validation  by  the  multitrait-multimethod  matrix.  Dsycheicgi cal  Bulletin, 
1959,  56,  81-105. 

Carter,  L.  F.  Some  research  on  leadership  in  small  groups. 

In  H.  Guetzkow  (Ed. ),  Groups ,  leadership  and  men :  Research  In  numan 
relations .  Pittsburgh:  Carnegie  Press,  1951,  Pp.  146-157. 


109 


110 


Cartwright,  D. ,  4  Zander,  A.  (Eds„ )  Group  dynamics.  (2nd  Ed.) 
New  York?  Row,  Peterson,  I960. 

Deutsch ,  M.  An  experimental  study  of  the  effects  of  cooperation 
and  competition  upon  group  process.  Human  Relations.  1949,  2,  199-231. 

Dewey,  J.  How  we  think.  Bostons  Heath,  1910. 

(Xilany,  D.  E.  Awareness,  rules,  and  propositional  controls 
A  confrontation  with  5-R  behavior  theory.  In  T.  Dixon  4  D.  Horton, 
Verbal  behavior  and  general  behavior  theory.  New  Yorks  Prentice-Hall. 
I960. 


Faucett,  R.  E.,  4  Newman,  P.  P.  Operation  hideouts  Preliminary 
report.  United  States  Navy  Medical  Research  Laboratory  Report,  12 
(13,  Uhole  No.  22B),  iii.  New  London,  Conn.,  July,  1953. 

firth,  R.  Elements  of  social  organization.  Bostons  Beacon,  1963. 

Garner,  UI.  R.  Context  effects  and  the  validity  of  loudness 
scales.  Journal  of  Experimental  Psychology.  1954,  48,  218-224. 

Garner,  UI.  R.,  Hake,  H.  UI. ,  4  Eriksen,  C.  UI.  Operation!  sm 
and  the  concept  of  perception.  Psychological  Review.  1956,  63,  149-159. 

Hackman,  J.  R.  Effects  of  task  characteristics  on  group 
products.  Technical  Report  No.  5,  Air  force  Office  of  Scientific 
Research,  1966. 

Hackman,  J.  R.,  Jones,  L.  E.,  4  McGrath,  J.  E.  A  set  of 
dimensions  for  describing  the  general  properties  of  group-generated 
written  passages.  Psychological  Bulletin,  1967,  6.,  379-390, 

Hase,  H  D. ,  4  Goldberg,  L.  R.  Comparative  validity  of 
different  strategies  of  constructing  personality  inventory  scales. 
Psychological  Bw^etln,  1967,  67,  231-248. 

Haythorn,  UI.  UI.,  4  Altman,  I.  "  “•rsor.e-J.i.ty  f  t-utcrs  in  1?  ,'ared 

environments.  In  M.  H,  Appley  and  R.  Trumbull  (Eds.),  Psychologic^, 
stress.  New  Yorks  Appleton-Century-Crofts,  1967,  Pp.  363-386. 

Haythorn,  UI.  UI.,  Altman,  I.,  4  Myers,  T„  J,  Emotional  sympto¬ 
matology  and  subjective  stress  in  isolated  pairs  of  men.  journal  of 
Experimental  Psychology.  1966,  1_,  290-305. 

Heinicke,  C. ,  4  Bales,  R.  f.  Developmental  trends  ir  the 
study  of  small  groups,  Seciometry.  1953,  16,  7-38. 

Hemphill,  J.  K. ,  4  ulestie,  C.  M.  The  measurement  of  group 
dimensions.  Journal  of  Psychology.  1950,  29,  325-342. 

Laforge,  R.  Research  use  of  the  ICL..  Oregon  Research  Institute 
Technical  Report  No,  4,  Eugene,  Oregon,  1963, 


ill 


Leach,  E.  P.  Political  s  ystems  of  highland  Burma.  Londons 
London  Scnccl  of  Economics  and  Political  Science,  1954, 

Leary,  T,  F.  Interpersonal  diagnosis  of  personality.  New  Yorks 

Ronald  Press,  1957, 

Levine,  A,  S.  Habitaoility  and  motivation  as  related  to  the 
Polaris  submarine.  United  States  Navy  BuPers,  Personnel  Research 
Division  Report,  FBM-11,  December,  1958, 

Lippitt,  R,  Training  in  community  relations  A^  research 
sxpicrat Ion  toward  new  group  skills.  New  vork;  Harper,  1949, 

McGrath,  J,  E.  A  descriptive  mcdel  for  the  study  of  inter¬ 
personal  relations  in  small  groups.  Journal  of  Psychological  Studies, 
1963,  14,  89-116.  . . .  . . . 

McGrath,  J.  E„  A  Altman,  I,  Small  group  research  -  A 
synthesis  ana  critique  of  the  fi.eiq.  New  »ork  Holt,  Rinehart,  A 
Winston,  1966., 

Murpny,  L.  B.  Social  behavior  ard  child  personality ,  An 
exploratory  study  of  some  roots  of  sympathy.  New  Yorks  Columbia 
Press ,  1937. 

Nadel,  S.  F«  The  theory  of  social  structure.  New  York*  Free 
Press,  1964 . 

Newccmo,  T,  M.  acquaintance  process .  New  York  Holt, 

Rinehart,  A  Winston,  1961, 

Otne,  M.  T.  Or  the  social  psychology  ef  the  psychological 
e »:perimr,r<t  6  Ulitn  particular  reference  to  demand  characterise  cs  and 
their  implications,  Amerf  can  Psyc’-'OloQls* ,  1962,  17,  776-783. 

Osgood,  7,  r„,  Suci,  G.  j, ,  A  Tannenba  m,  P.  h  The  measurement 
off  waning,  Urtana  Uni/er-sity  cf  Illinois  ,  ;  . 

Partan,  M,  Social  play  among  preschool  uni^upen.  ■.  f-  _ 

Abnormal  ana  Social  Psychology,  1933,  28,  136-..  4?. 

Pitch,  T.  C„  Report  on  the  ejects  cK  prolonged  snorkelling  or 
the  r.uaj*-*  q*  the  officers  and  men  and  on  the  general  hacitabili^ > 
of  the  Guppy-Snorkel  submarine  USS  TrumDetfxsh  (SS425),  United  States 
Navy  tfWd.lca..  Research  Laboratory  Report  Nc  c  13?,  New  London,  Conn,, 

April,  1948 

Sehae’ er,  L.  S.  Converging  conceptual  models  for  maternal 
behavior  and  for  cnild  behavior.  Paper  presented  at  the  Second  Annual 
Conference  of  tre  Social  Science  Institute,  wasningtcn  University,  St. 
Louis,  I960, 

S;<v:‘  y  ,  W.  L.  ■'  ;  RC  'j  >*-.  dimer  signal  theory  cf;  ■  ♦.  c  ersoral 
behavior.  New  Yorks  Holt,  Rinehart,  A  Winston,  1958. 


Schutz,  Hi  C.  The  ego,  TIRO  theory  and  the  leader  as  completer. 

In  lc  Petrullc  4  B.  Bass  (Eds„),  .eaoership  ar.d  interpersonal 
Deha.  iota  Ne«  York  Holt,  Rinehart,  A  Ulinston,  1961. 

Scott,  ul.  A.  Social  desirability  and  individual  conceptions  of 
the  desirable.,  journal  of  Abnormal  and  Social  Psychology,  1963,  67_, 

574- 585. 

Shaw,  M.  E.  Communication  networks.  In  L.  Berkowitz  (Ed.), 
Advances  in  Experimental  Social  Psychology,  Vcl.  I.  New  Vorks 
Academic  Press,  1964. 

Sommer,  R.  Studies  in  personal  space,  Sociometry.  22,  1959, 

247*260. 

Sommer.  R.  Leadership  and  group  geography,  Sociometry,  24, 

1961,  99-109. 

Sommer,  R  "Tie  distance  for  comfortable  conversation,  Soolcmetry . 

1962,  25,  111-116 . 

Sommer,  R.  Further  studies  of  small  group  ecology.  Sociometry , 

1965,  28,  337-348. 

Smith,  HI.  ffl.  Observations  over  the  lifetime  of  a  small  isolated 
group  Structure,  danger,  boredom,  and  vision,  Psyrnoiog* cal  Reports, 

1966,  19,  475-514. 

Talland,  G.  A.  Task  and  interaction  process  Seme  character 
istics  of  therapeutic  group  discussion,  jouma.  of  Abrcrmal  and 
So ci a 1  PstchoioQy.  1955,  50,  105-109. 

Theodoreon,  G.  A.  Elements  in  the  progressive  development  of 
small  groups.  Social  Forces.  1953,  311-320, 

Tuchmar  f  >\  Developmental  secuerce  in  small  grp  cs. 
Psychological  B-.  .^et.n,  1965,  63,  364-399. 

Hlebb,  E  J. ,  Campbell,  D  T,,  Schwartz,  R.  D„ ,  A  Sechrest,  L. 
Unobtrusi «/e  measures.  Chicago  Rare  McNally,  1966, 

ule.bre*,  B.  6.  Ps\  holcgical  and  Physiological  effects  cf 
long  periods  of  submergence.  Araiysis  of'  data  collected  durirg  a 
265*-hcur  completely  submerged  habitability  cruise  made  by  the  USS 
Nautilus  (SSN571),  United  States  Naval  Medical  Research  laboratory 
Report  No,  16  (3,  Hhole,  No.  28i),  iv,  New  Condor,  Conn.,  February,  1957. 

Hleybrew,  B.  B.  Psychological  problems  of  prolonged  martre 
submergences.  In  ffl  Burns,  M.  Chambers.  At.  Henc^e  'Eds  -,  Ur  usual 
Env  ironments  and  humar  Decay icr .  Pn > sio. cgi ca.  and  pa«;r * .  r  g • c a i  pr cbi ems 
cj  men  In  space?  ~New  York  Free  Dress,  1963,  Pp„  S'7 - 1 25 , 

Ulevbrew,  B.  B.  Patterns  oT'  ps vchophysiological  response  to 
military  stress.  In  M.  H.  Appiey  A  R.  Trumbull  (Eos,  Psycho] ogj  csl 
stress.  New  >crw  Appleton-Century-Crof ts,  i967. 


113-120 


APPENDIX 


cr> 

CD 

r- 

lO 

in 

-*3- 

to 

CM 


o  n  od 

CN  CM  CM 

CTt  CM  LD 
rH  CM  CM 

CD  ~4  <t 
H  N  CO 

r-  c^-  r- 

03  03  03 
LH  LD  IT) 
^  ^ 
to  to  to 

CM  CM  CM 


4-> 

O 

cr  •  a) 
U  > 
*  +>  -H 

cn  lo  cn 
ct>  c  cn 
3  O  (0 
-tfc  rn  rj  a 


OWN  1/3 
CM  tO  fO 

CD  -H  M 

cm  ro  to 

c^-  a  fo 
cm  to  to 


r~  r-  r- 

03  03  03 

in  in  in 

^ 

to  to  to 

CM  CM  CM 


CD  ^4  <t 
rO  <J  <3 

o  to 

ri  st  <j 

VO  0>  CM 
rO  rO  ^ 


t —  c^-  r- 

lO  iO  OD 

in  in  in 

ro  ro  m 

CM  CM  CM 


Is-  O  rO 

<j  in  in 

03  03  ro 
<*  in 

m  co  — 4 

<3  un 


r-  r^- 

03  03  d' 

lo  o  n 

MJ  <t 

to  *o  ro 

CM  CM  CM 


® 

cn  O 

T) 

U  if) 

U  O  X 
O  ”3  U 


(0 

D 

if) 

©  o  ro 

t-.cc 

P  D  -H 
TO  03 
N 

O  H 
CD  z  O 


-i 

H 

3  W  « 

a  >  a 
-  a 
•  a  ro 
cn  a  cn 
o  3 
a  cn  o 


Talking 


Reciprocal  looking 
while  smiling 


Session  I  II 


III  IV 


Group  # 


Date, 

Lsfc. 

Hour : 
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Room  _  film  tdq. 


During  the  first  S  seconds  of  each  manure,  as  signalled  by  the 
Hunter  timers'. 


1)  take  three  (3)  photos,  spaced  evenly  ever  the  5  seconds, 

2)  observe  who  is  talking  u/hLle  you  take  the  photos. 


d/hen  the  signs1  tells  you  the  5  seconds  are  over,  record  who  was 
speaking  by  placing  a  chac*  ?'n  the  appropriate  box.  If  more  than 
one  person  speaks  during  tne  5  seconds,  record  them  in  the  order 
of  speaking,  (1,  2,  3) 

Take  one  blank  Prams  (•’and  ove:  lens)  between  each  task  (1st.  hour)  or 
between  each  J5-nim.*te  segment  (2nd  hour)  and  note  this  blank  fr-i.ne. 
Take  3  blank  frames  (^ano  over  lens)  at  the  beginning  of  aac i  lour. 
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DIRECTION J:  This  sheet  contains  a  list  of  descriptive  words  and  phrases  which  can  be  used  to  describe  people. 

Mark  those  adjectives  which  are  generally  descriptive  of 
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FORM  2-g 

OBSERVER  POST-TASK  RATING  FORM  (Group) 


Task 

1, 

2 

SUBJECT. 

A_ 

Order 

1,_ 

2.  3. 

4 

B_ 

Group  # 

Date 

C 

Observer 

D_ 

I 

Rate  each  of  trie  four  group  members  on  each  of  the  f piloting  seal'-**.  For 
each  man  (A,B,C,D)  place  his  Letter  in  tne  appropriate  box.  Mbit  ,?n  one 
man  may  appear  in  any  given  box.  But  each  maTi  should  appear  on  y  once  on 
each  scale. 


□ostructsd  groups' 
task  performance 


talked 


Never  moved  hands 
or  body 


Bored  with  gps. 
activity 


..'jh-'iissive  in  group; 
accepted  influence 
r  in. i  others 


Showed  dislike 
fo,:  others 


Seemed  disliked 
by  otners 


NlM  PLACE  A  CHCC*  aOMCJMt  ON  THESE  LINES  TO  INDICATE  THE 
RATING  rOU  jrfOULD  GIVE  T’«L  G R Oil • '  AS  A  DHOLE 


Pooled  resources 
smoothly  in  task 
performance 

Active  pace:  Hign 
tempo  of  grcii 
activity  (taik 
relevant  or  not  ) 


1 

1 

0 

1 

3 

M 

Confusion  in 
task  performance 


slowness  as  a 
group;  low 
activity  level 


$ 


w 


Page  2 
Form  2-g 
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DIRECTIONS:  Note  the  roles  played  by  each  member  of  the  group. 
Place  a  member's  letter  (A,B,C,D)  In  each  space,  according  to  the 
role(s)  played.  More  than  one  letter  is  allowed  in  each  space; 
any  any  member  may  be  listed  more  than  once. 


Leader _ 

Scribe/recorder _ 

Inactive  Member _ 

Normal/Active  Member 


FORM  S-2-g  (Group) 

SUBJECT  POST-TASK  RATINGS 
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The  four  members  of  your  group  are:  (circle  your  own  letter) 

A 


B 


C 


D 


Evaluate  yourself  and  EACH  OTHER  MEMBER  of  your  group  on  the  scales  below. 

Place  each  man's  letter  in  one  of  the  boxes  on  each  scale.  More  than  one 
letter  ma*  appear  on  any  one  box.  Consider  ONLY  the  task  you  have  just  completed. 


Obstructed  groups 
task  performance 


Never  talked 


Never  moved  hands 
or  body 


Bored  with  gps. 
activity 


Submissive  in 
group;  accepted 
influence  from 

oths rs 

Showed  dislike 
for  others 


Seemed  disliked 
by  others 


9 


NOW  PLACE  A  CHECK  SOMEWHERE  ON  THESE  LINES  TO  INDICATE  THE 
RATING  YOU  WOULD  GIVE  THE  GROUP  AS  A  WHOLE 


Pooled  resources 
smoothly  in  task 
Performance 

Active  pace;  high 
tempo  of  group 
activity  (task 
relevant  or  not) 


Confusion  in 
task  performance 


Slowness  as  a 
group;  low  activity 
level 


Relationship  to  the  Study  of  Rosemary  Louie 


The  present  study  is  part-  of  a  larger  program  of  research  on 
psychologies,  stress,  uiitn  Dr,  Joseph  E,  McGrath  as  principle  investigator. 
The  role  of  the  presert  study  within  this  larger  program  is  to  explore 
relevant  methodological  problems  and  to  develop  a  lcw-inference  obser¬ 
vational  method  for  the  study  of  small  groups  under  stress, 

This  study  has  a  particularly  close  relationship  to  a  study 
carried  out  simultaneously  by  Rosemary  Lowe.  As  described  in  Chapter  II, 

5s  for  the  present  study  were  recruited  from  Lowe’s  subject  population, 

5s  in  the  present  study  were  compared  against  Lowe’s  total  population 
on  the  two  dimensions  of  the  I  CL  (Dominance  and  Affect)  and  on. 
activity  level  as  measured  by  the  number  of  items  endorsed  on  the  TC>»o 
Results  of  these  tests  are  presented  in  Table  *0,  No  significant 
differences  were  found,  from  the  results  of  the  Interpersonal  Adjecti  e 
Checklist  (I CL)  it  appears  that  a  representative  subset  of  Lowe’s  Ss 
was  recruited, 

TABLE  40 

test  of  differences  BETWEEN 
LOWE ‘5  SUBJECTS  AND  STAPERT’S  SUBJECTS 
ON  THE  INTERPERSONAL  ADJECTIVE  CHECKLIST 


Trait 

.owe ■ s 

N=369 ) 

Stapert's 

(N -59 ) 

t 

P 

M 

5D 

HI 

SD 

Dominance 

1,68 

3,3 

1.61 

4.5 

.11 

ns 

Affect 

-,80 

4,1 

-2,40 

?. 6 

1,57 

ns 

Activity 

16,  GO 

6,1 

i*c6G 

5,4 

1.79 

ns 
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Both  studies  used  Hackman :s  tasks*  The  Lowe  study  used  only 
twelve  of  the  tasks,  four  of  each  task  type,  while  the  present  study 
used  all  108  tasks  (36  of  each  type).  The  12  tasks  used  by  Lowe  were 
used  in  the  present  study  as  the  last  two  tasks  of  each  of  the  two 

overload  hours* 

In  addition  to  the  common  subject  population  and  common 
tasks,  there  were  other  ties  between  the  two  studies.  The  same  set 
of  trained  assistants  acted  as  observers  in  the  experiment.  And  the 
judges  who  rated  written  products  were  jointly  trained  and  employed  for 
rating  products  from  both  studies. 

These  connections  between  the  two  studies  have  provided  relation¬ 
ships  which  allow  comparisons  between  the  studies.  At  the  same  time, 
it  was  j  r. sible  to  collect  distinct  bodies  of  data  which  could  be 
analyzed  separately  and  independently  for  each  study. 


Note  on  3-man  vs.  4-man  Groups 


Both  3-man  and  4-man  groups  are  ine.ii.  ■‘ad  in  the  present  study. 

The  original  intent  was  to  study  only  4-man  groups.  The  3-man  groups 
were  accepted  only  where  it  was  impossible  to  schedule  a  4-man  group. 

In  fact,  all  of  the  experimental  groups  were  originally  scheduled  as 
4-man  groups.  At  least  two  attempts  were  made  to  have  all  four  memters 
present  at  the  first  session.  If  both  efforts  failed,  the  group  was 
run  as  a  3-man  group  throughout  the  experiment. 

Some  4 -man  groups  had  difficulty  maintaining  their  full  member¬ 
ship  for  all  four  experimental  sessions.  In  spite  of  numerous  telephoned 
reminders  and  other  efforts  to  keep  all  Ss  who  began  the  experiment,  some 
groups  lost  members.  These  experiences  were  some  indication  of  why 
multiple-occasion  research  on  groups  is  seldom  done,, 

The  numerical  differences  between  3-tran  and  4-man  groups  raised 
some  problems  for  da^a-handling  and  statistical  analysis.  The  scores 
which  required  re-scaling  before  analysis  were  peer  ratings  of  all 
traits  and  evert  records  of  influence  and  affect.  The  peer  ratings  of 
a  subject,  were  always  averaged  before  statistical  analysis.  Thus,  in  a 

3- man  group,  the  sum  of  the  peer  ratings  required  division  by  two  and  in 

4- man  groups  required  division  by  three.  This  was  a  simple  adjustment 
to  make. 

Adjustment  of  the  event  records  was  a  bit  more  complicates. 
Analysis  of  the  scoring  procedures  shewed  that  equivalent  behaviors  in 
3-  and  4-man  groups  reauired  3-man  influence  scores  to  be  multiplied  by 
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1.50  and  the  3~man  affect  scores  to  be  multiplied  by  1.25.  This  made 
scores  from  3-man  and  4-man  groups  numerically  comparable. 

However,  there  is  a  more  fundamental  question  involved  than  the 
re-scaling.  That  is,  does  group  size  play  a  part  in  eliciting  different 
levels  of  influence,  affect,  or  activity  in  the  group?  This  question 
was  answered,  in  part,  by  comparing  mean  scores  from  3-  and  4-man  groups 
on  each  variable  (each  trait  measured  by  each  method).  Table  41 
presents  these  comparisons. 

Of  the  15  comparisons,  14  are  not  significantly  different.  The 
event  records  measure  of  affect  shews  a  small  (.05  unit)  difference 
which  is  barely  statistically  significant  (p<  .05).  This  sole  signifi¬ 
cant  difference  seems  as  likely  to  have  been  caused  by  a  slight  under- 
correction  in  the  re-scaling  procedures  as  by  any  real  difference  between 
the  two  group  sizes.  The  indication  from  these  comparisons,  therefore, 
is  that  3-man  and  4-man  groups  did  not  differ  in  the  elicitation  of  the 
measured  behaviors. 

It  is  possible  that  an  interaction  effect  was  present  between  group 
size  and  time  or  stress.  And  any  such  interaction  is  obscured  by  the  above 
analysis.  However,  if  there  are  strong  differences  between  3 -man  ano 
4 -man  grcjps,  sumo  main  effects  (shown  not  to  have  occurred^  would  have 
beer  expected. 

The  product  ratings  in  the  present  study  (combined  from  3-mar  and 
4-man  groups)  matched  Hackman's  (3-man  only)  product  scores  very  well.  Had 
the  4-mar-  groups  differed  systematically  on  the  products,  such  a  gooa  maten 
would  have  been  extremely  unlikely. 

So,  the  data  suggest  that  on  the  behaviors  and  performances  measured 
in  the  present  study,  no  significant  differences  stand  between  3-man  and 
4-man  groups.  The  combination  of  data  from  the  two  sizes  of  groups  there¬ 
fore  seems  legitimate. 
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TABIC  41 


COMPARISONS  Or  3-MAN  AND  4. MAN  CROUPS 


3 -man 

Croupe 

4 -man 

Croupe 

Method 

Trait 

Mean 

Ver. 

Mean 

Ver. 

t 

P 

Influence 

4.99 

0.47 

5.01 

0.56 

0,19 

ns 

Self 

Ratings 

Affect 

5.02 

0.91 

5.36 

0.68 

1.24 

ns 

Activity 

4.43 

0.B9 

4.64 

0.59 

0.03 

ns 

Influence 

5.01 

0.46 

5.05 

0.22 

0  26 

ns 

Peat 

Ratings 

Affect 

5.12 

0.67 

5.31 

0.13 

1, 12 

ns 

Activity 

4.35 

0.78 

4.61 

0.01 

1.52 

r>s 

Influence 

4.46 

0.08 

4.76 

0.75 

1.27 

ns 

Obs. 

Ratings 

Affect 

4.54 

0.25 

4.72 

0.21 

1  .44 

ns 

Activity 

4.07 

1.47 

4.56 

0.48 

1 , 06 

ns 

Influence 

1.16 

17.27 

1.40 

22.13 

0.24 

rs 

Test 

(ICL) 

Af  f  r  • 

-1.99 

60 . 57 

-1.64 

53. 8G 

o,  r? 

ns 

Activity 

30.93 

1.10.2c 

27.75 

1/6  50 

G  t  " ." 

;“*c 

IrH  jerce 

0.00 

0.a3 

0.00 

0.50 

o.oo8 

,*»  c 

E  .art 
Records 

Af  fact 

Cc.l 

0.  Oj 

0.16 

0  01 

2  05 

S  05 

Art  i  -  it  v 

1.25 

0.12 

1.  ?? 

0.09 

0  ;4 

ns 

*TMs  a^ar  a. ue  l*  necessan.t  0.00  #c'  grc.c 


VITA 


John  Charles  Stnpert 


BORN  September  25,  19«2,  Kalamazoo.  Michigan 
MARITAL  STATUS  Married  (1  child) 

EDUCATION 

1948-1956  North  Christian  Ciemertary  School,  Kalamazoo,  Michigan 

1956*1960  Kalamazoo  Christian  High  School,  Kalamazoo,  Michigan 

i960,  '61,  '62  Western  Michigan  University,  Kalamazoo,  Michigan 
(summers  only) 

1960-1963  Hope  College,  Holland,  Michigan 

1963  1966  Fuller  Theological  Seminary,  Pasadena,  California 

1966*1969  University  of  Illinois,  Urbana,  Illinois 

DEGREE  S 

A. B.  -  1963  Hope  College.  Holland,  Michigan.  Major:  Psychology 

B. D,  -  1966  Fuller  Theological  Seminary,  Pasadena,  California 

M  A.  *  1968  University  of  Illinois,  Urbana,  Illinois.  Major  Psychology 

PhD  1969  University  of  Illinois,  Urbana,  Illinois. 

Major-  Personality  (Psychology) 

Minor:  (split)  Social  Psychology 
Sociology 

PUBLICATIONS 

Attraction  and  Disagreement-Produced  Arousal.  With  Gerald  Ciore,  Jr. 
jouma.  of  Personality  and  Social  Psyc^o^ogy  (in  press). 

EXPERIENCE 

1962-1963  Student  Discussion  Assistant 

Department  of  Psyr.n  .ogy,  Hope  .  .egs 
1964*^966  Teaching  Assistant  in  Hermeneutics 

fuller  Theological  Seminary 
1966*1968  Teaching  Assistant 

Department  of  Psychology,  University  of  Illinois 

1967  (summer)  Research  Assistant,  University  of  Illinois 

(Dr.  Frank  Cost  in) 

1968  1969  Research  Assistant,  University  of  Illinois 

(Dr.  Joseph  E„  McGrath) 

PROFESSIONAL  AND  HONORARY  SOCIETIES 

American  Psychologi  cal  Association  (Pro* essior.al ) 

Christian  Association  for  Psychological  Studies  (Professional ) 

Eta  Sigma  Phi  (Classics)  (Honorary) 

Pi  Kappa  Delta  (forensics)  (Honorary)  ' 


132 


